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DWF4 POLYNUCLEOTIDES, POLYPEPTIDES AND USES THEREOF 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is related to provisional patent applications serial nos. 
60/119,657, filed February 11, 1999 and 60/119,658, filed February 11, 1999, from 
which priority is claimed under 35 USC §1 19(e)(1) and which appUcations are 
incorporated herein by reference in their entireties. 

10 

TECHNICAL FIELD 
The present invention relates to novel polynucleotides isolated from dwarf 
plants. The dwf4 polynucleotides encode all, or a portion of, a DWF4 polypeptide, a 
cytochrome P450 enzyme that mediates multiple steps in synthesis of 
1 5 brassinosteroids. The present invention also relates to isolated polynucleotides that 
encode regulatory regions of dwf4. Uses of the dwf4 polypeptides and 
polynucleotides are also disclosed. 

BACKGROUND 

20 Plant growth is accompHshed by orderly cell division and tightly regulated cell 

expansion. In plants, the contribution of cell expansion to growth is of much greater 
significance than in most other organisms; all plant organs owe their final size to a 
period of significant cell elongation, which usually follows active cell division. 
Further, the sessile nature of plants requires that they make fme but responsive 

25 adjustments in growth to survive harsh environmental conditions and to optimize their 
use of limited resources (Trewavas (1986) "Resource allocation under poor growth 
conditions: A major role for growth substances in developmental plasticity" In 
Plasticity in Plants, D.H. Jennings and A. J. Trewavas, eds (Cambridge, UK: Company 
of Biologists Ltd.), pp. 31-76). 

30 In Arabidopsis, cell elongation is largely responsible for hypocotyl growth in 

germinating seedlings and extension of inflorescences (bolting) at the end of 
vegetative growth. Coordinate control of plant growth is regulated by both external 
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stimuli and internal mechanisms. Of the external signals^ the most obvious is light 
(Deng, X.-W. (1994) Cell 76:423-426). Light inhibits hypocotyl elongation and 
promotes cotyledon expansion and leaf development in seedlings, and photoperiod is 
crucial for flower initiation in a large number of species. 
5 The internal components of plant signaling are generally mediated by 

chemical growth regulators (phytohormones; reviewed in Klee, H., and Estelle, M. 
(1991) Annu, Rev. Plant Physiol Plant MoL Biol 42:529-551 ). Thus, plant growth 
in response to environmental factors is modulated by plant hormones acting alone or 
in concert (Evans "Functions of hormones at the cellular level of organization" In 

10 Hormonal Regulation of Plant Physiology, T.K. Scott, ed (Berlin: Springer- Verlag), 
pp. 23-79), and growth depends on regulated cellular events, such as division, 
elongation, and differentiation. 

Gibberellic acid (GA) and cytokinins promote flowering; in addition, GA 
stimulates stem elongation, whereas cytokinins have the opposite effect, reducing 

15 apical dominance by stimulating increased axillary shoot formation. Conversely, 
auxins promote apical dominance and stimulate elongation by a process postulated to 
require acidification of the cell wall by a K"-dependent H^-pumping ATPase (Rayle, 
D.L., and Cleland, R.E. (1977) Curr, Top, Dev. Biol 11:187-214). 

In addition to the classic hormones, such as auxin and gibberelUc acid (GA), 

20 brassinosteroids (BRs) have been discovered to be important in growth promotion 
(reviewed in Clouse (1996) Plant J. 10:1-8). The most recently discovered class of 
plant growth substances, the BRs, has been to date the least studied; however, rapid 
progress toward understanding BR biosynthesis and regulation is now being made 
(Yokota, T. (1997) Trends Plant Sol 2:137-143), The term BRs collectively refers to 

25 the growth-promoting steroids found in plants (Grove et al. (1979) Nature 

281 :216-217). They are structurally very similar to the molting hormones of insects, 
ecdy steroids (Richter and Koolman (1991) "Antiecdy steroid effects of 
brassinosteroids in insects" in Brassinosteroids: Chemistry, Bioactivity, and 
AppUcations, H.G. Cutler, T. Yokota, and G. Adam, eds (Washington, DC: American 

30 Chemical Society), pp. 265-279), but active BRs have unique structural features. As 
shown in Figure 1, a 6-oxolactone or 7-oxalactone in the B ring, 5 a hydrogen, and 
multiple hydroxylations at four different positions with specific stereochemistry have 
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been proposed as an essential configuration for BRs (reviewed in Marquardt and 
Adam (1991) "Recent advances in brassinosteroid research" in Chemistry of Plant 
Protection, W. Ebing, ed (Berlin: Springer-Verlag), pp. 103-139). Among >40 
naturally occurring BRs, brassinolide (BL; 2a, 3a, 22(R), 23(R)-tetrahydroxy-24(S)- 
5 methyl-B-homo-7-oxa-5a-cholestan-6-one) has been shown to be the most 

biologically active (reviewed in Mandava (1988) Annu. Rev. Plant Physiol Plant 
Mol Biol 39:23-52 ). As a major biological effect, BRs stimulate longitudinal 
growth of young tissues via cell elongation and cell division (reviewed in Clouse 
(1996), supra\ Fujioka and Sakurai (1997a) Nat Prod. Rep, 14:1-10). 
1 0 Elucidating the BR biosynthetic pathways has been a major area of recent 

interest. Biochemical analyses have been used to elucidate the BR biosynthetic 
pathway (Fujioka et al. (1996) Plant Cell Physiol 37:1201-1203; Choi et al. (1997), 
Phytochemistry 44:609-613), and mutational analyses are being used to confirm this 
pathway. Similar to the biosynthetic pathways of the human steroid hormones and 
15 insect ecdysteroids (Rees (1985) "Biosynthesis of ecdysone" in Comprehensive Insect 
Physiology, Biochemistry and Pharmacology, G. A. Kerkut and L.L Gilbert, eds 
(Oxford, UK: Pergamon Press), pp. 249-293; Granner, D,K. (1996) "Homiones of the 
gonads" in Harper's Biochemistry, R.K. Murray, D.K. Granner, P. A. Mayes, and 
V.W. Rodwell, eds (Stamford, CT: Appleton and Lange Press), pp. 566-580), BRs are 
20 synthesized via multiple parallel pathways (Fujioka et al. (1996) Plant Cell Physiol 
37:1201-1203; Choi et al. (1997), supra). Starting from the initial precursor, 
campesterol (CR), the BR intermediates undergo a series of hydroxylations, 
reductions, an epimerization, and a Baeyer-Villigerutype oxidation leading to the 
most oxidized form, BL (Fujioka and Sakurai (1997b) Physiol Plant. 100:710-715; 
25 Figure 1). Castasterone (CS) oxidation, the last step in BR biosynthesis, is not found 
in some species, such as mung bean. In that case, CS plays a role as the major BR 
rather than BL (Yokota et al. (1991) "MetaboUsm and biosynthesis of 
brassinosteroids" in Brassinosteroids: Chemistry, Bioactivity, and Apphcation, H.G. 
Cutler, T. Yokota, and G. Adam, eds (Washington, DC: American Chemical Society), 
30 pp. 86-96). Traditionally, BR biosynthetic pathways have been elucidated by feeding 
deuterio-labeled intermediates to BR-producing cell lines of Madagascar periwinkle 
(Sakurai and Fujioka (1996) "Catharanthus roseus (Vinca rosea): In vitro production 
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of brassinosteroids" in Biotechnology in Agriculture and Forestry, Y.P.S. Bajaj, ed 
(Berlin: Springer-Verlag), pp. 87-96.). The present model, including parallel 
branched pathways and early and late C-6 oxidation pathways, was established using 
these feeding studies (Fujioka and Sakurai (1997a), supra, Fujioka and Sakurai 
5 (1997b), supra; Sakurai and Fujioka (1997) Biosci. Biotechnol Biochem, 
61:757-762). 

Although the brassinosteriod system is a less well understood class of plant 
growth substances (BRs; Mitchell, et al. (1970) Nature 225:1065-1066; Grove et al. 
(1979) Nature 281:216-217; Mandava, N.B. (1988) Annu. Rev, Plant Physiol Plant 
10 MoL Biol 39:23-52), several such compounds have been identified and are known to 
effect elongation of cells in various plant tissues, their biosynthesis, regulation, and 
mechanism of action have only recently begun to be elucidated (reviewed in Clouse, 
S.D. (1996) Plant 1 10:1-8; Fujioka, S., and Sakurai, A. (1997) Physiol Plant 
100:710-715). 

1 5 Several types of dwarf or dwarflike mutants have been described in 

Arabidopsis. A number of mutations have been identified that affect either 
light-dependent (cop, det, md fusca [fus; another group of mutants with some 
members perturbed in hght-regulated growth]) or hormone signaling {axr2) pathways 
and whose pleiotropic phenotypes include defects in cell elongation. The majority of 

20 these mutants also have other alterations in their phenotypes. At least five GA 
mutants have been described as being reduced in stature (Koomneef and Van der 
Veen (1980) Theor. Appl Genet. 58:257-263). GA biosynthetic mutants may also 
have no or defective flower development and are marked by an absence of viable 
pollen. Reduced levels of endogenous gibberellins are also a characteristic (Barendse 

25 et al.(1986) Physiol Plant 67:315-319; Talon et al. (1990) Proc. Natl Acad Set 
USA 87:7983-7987), and their phenotype can be nearly restored to that of the wild 
type by the addition of exogenous GA. (Koomneef and Van der Veen (1980) Theor. 
Appl Genet 58:257-263). 

Another hormone mutation, auxin resistant! (axr2), results in plants with a 

30 dwarf phenotype both in the Ught and in darkness as well as increased resistance to 
high levels of auxin, ethylene, and abscisic acid (Timpte et al. (1992) Planta 
188:271-278). An interesting relationship exists between Hght regulation and 
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cytokinin levels. Arabidopsis seedlings grown in the dark in the presence of 
cytokinins have open cotyledons, initiate chloroplast differentiation and leaf 
development, and activate transcription from the chlorophyll a/h binding protein gene 
{CAB) promoter. Importantly, they also display a cytokinin dose-dependent dwarf 
5 phenotype. 

Dwarf Arabidopsis mutants that are rescued by addition of BRs have also been 
described (Kauschmann et al. (1996) Plant J. 9:701-713; Li et al. (1996) Science 
272:398-401; Szekeres et al. (1996) Cell 85:171-182; Azpiroz et al. (1998) Plant Cell 
10:219-230), including the following three mutants: dwarfl {dwfl\ Kauschmann et al. 

10 (1996) Plant /. 9:701-713), constitutive photomorphogenesis and dwarfism (cpd; 
Szekeres et al. (1996) Cell 85:171-182), and det2 (Li et al. (1996) Science 
272:398-401). These mutants have been shown to be defective in steroid 
biosynthesis. DWFl (Feldmann et al. (1989) Science 243:1351-1354) was cloned 
first (GenBank accession number U12400). Takahashi et al. (1995) Genes Dev. 

15 9:97-107 hypothesized that DWFl, which they isolated with an allele of dwfl, 

referred to as diminutol (diml), contains a potential nuclear targeting signal, which 
may confer a regulatory function to the protein. However, Mushegian and Koonin 
(1995) Protein Set 4:1243-1244 indicated that DWFl displays limited homology with 
flavin adenine dinucleotide (FAD)independent oxidoreductase, suggesting an 

20 enzymatic function in BR biosynthesis. According to Kauschmaim et al. (1996), 
supra (dwfl-6 described as cabbage 1 [cbbl]), dwfl mutants were rescued by 
exogenous application of BRs. 

DET2 was shown to encode a putative steroid 5a-reductase, mediating an 
early step in BR biosynthesis (Li et al. (1996), supra , Li et al. (1997) Proc. Natl. 

25 Acad Sci. USA 94:3554-3559; Fujioka et al. (1997) Plant Cell 9:1951-1962; Figure 
1). Moreover, detl and det2 have a decreased requirement for cytokinins in tissue 
culture and appear to be saturated for a cytokinin-dependent delay in senescence 
(Chory Gt dil (1994) Plant PhysioL 104:339-347). CPD has been proposed to be a 
novel cytochrome P450 (CYP90A1; Szekeres et al. (1996), supra), encoding a 

30 putative 23a-hydroxylase that acts in BR biosynthesis. The range of phenotypes in 
the deetiolated (det) and constitutive photomorphogenic (cop) light-regulatory 
mutants is broad. Mutations mDETl, COPl, COPS, C0P9, COP 10, and COP 11 
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result in constitutive derepression of substantial portions of the photomoiphogenic 
program (Chory, et al. (1989b) Cell 58:991-999; Deng, X.-W., and Quail, P.H. (1992) 
Plant J. 2:83-95; Wei, K, and Deng, X.-W. (1992) Plant Cell 4:1507-1518; Wei et al 
(1994) Plant Cell 6:629-643), whereas mutations in C0P4 seem to affect only 
5 morphology and gene expression (Hou et al. (1993) Plant Cell 5:329-339). The only 
invariant phenotype in this class of hght-regulatory mutants is a substantial reduction 
in height in both light and darkness. 

There are additional dwarfs that are insensitive to one of these hormones, such 
as bri {brassinosteroid insensitive', Clouse et al. (1996) Plant Physiol 111 :671-678; 

10 Li and Chory (1997) Cell 90:929-938), gai (gibberellic acid insensitive; Koomneef et 
al. (1985) Physiol. Plant, 65:33-39), and axr2 (auxin resistant!; Timpte et al, (1994) 
Genetics 138:1239-1249). Clouse et al. (1996), supra isolated bri by screening ethyl 
methanesulfonate-mutagenized populations for mutants whose root growth is not 
retarded at inhibitory concentrations of BR. Thus, the BRI protein is proposed to be 

15 involved in BR signal perception or transduction (Clouse (1996), supra). 

Kauschmann et al (1996), supra described a phenotypically similar mutant cbb2 that 
maps to the same location. In addition, the dwf2 alleles possess a phenotype similar 
to bri and map to the same region (Feldmann and Azpiroz (1994) ''dwarf {dwf) and 
twisted dwarf (twd)'' in Arabidopsis: An Atlas of Morphology and Development, J. 

20 Bowman, ed (New York: Springer- Verlag), pp. 82-85), It seems likely that all of the 
BR-insensitive dwarf mutants described to date are allehc. Recently, 57f/has been 
cloned and shown to encode a leucine-rich-repeat receptor kinase, suggesting a role in 
the BR signal transduction pathway (Li and Chory (1997), supra). 

Mutants defective in BR biosynthesis have also been isolated in other plant 

25 species. Bishop et al. (1996) Plant Cell 8:959-969 isolated a tomato dwarf mutmt by 
transposon tagging. The tomato Dwarf gene encodes a pioneering member of the 
CYP85 family, and it appears to be involved in BR biosynthesis. In addition, Nomura 
et al. (1997) Plant Physiol 1 13:31-37 reported that the Ika and Ikb mutants in garden 
pea are deficient in BR biosynthesis {Ikb) or perception {Ika). 

30 Currently, little is known about the downstream events that occur in response 

to these signals and thereby directly control cell size. This is because the biochemical 
and cell biological processes involved have thus far been difficult to address. In 
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addition, there is little information about the integration of regulatory signals 
converging at the cell from different signaling pathways and the ways they are 
coordinately controlled. In particular, the interaction of Ught and hormones in the 
control of cell elongation is not clear. Thus, there remains a need for the 
5 identification and characterization of additional mutants and polypeptides encoded 
thereby involved in these pathways of plant growth. 

SUMMARY OF THE INVENTION 
In one aspect the invention includes an isolated dwf4 polynucleotide 

10 comprising an open reading frame that encodes a polypeptide comprising (i) a 
sequence having greater than 43% identity to the amino acid sequence of 
SEQUENCE ID N0:2; (ii) a sequence comprising at least about 10 contiguous amino 
acids that have greater than 43% identity to 10 contiguous amino acids of 
SEQUENCE ID N0:2, or a complement or reverse complement of said 

1 5 polynucleotide. In certain embodiments, the polynucleotide will have at least 1Q% 
identity to the DWF4 polypeptide-coding region of SEQ ID N0:1 or to complements 
and reverse complements of this region. In further embodiments, the isolated dwf4 
polynucleotide comprises the nucleotide sequence of SEQ ID N0:1, complements and 
reverse complements thereof. The polynucleotide may also comprise at least 30 

20 consecutive nucleotides of SEQ ID NO : 1 . 

In another aspect, the invention includes an isolated dwf4 polynucleotide 
comprising (i) a sequence having at least 50% identity to SEQ ID N0:1, complements 
and reverse complements thereof or (ii) a sequence comprising at least about 15 
contiguous nucleotides that has at least 50%o identity to SEQ ID NO:l, complements 

25 and reverse complements thereof. In certain embodiments, the isolated dwf4 

polynucleotide has at least 50% identity to the DWF4 polypeptide-coding region of 
SEQ ID NO:l, complements and reverse complements thereof. In ftirther 
embodiments, the isolated dwf4 polynucleotides described herein comprise the 
nucleotide sequence of SEQ ID N0:1, complements and reverse complements thereof 

30 or nucleotide sequences comprising at least 30 consecutive nucleotides of SEQ ID 
NO: 1 . Any of the dwf4 polynucleotides described herein may be genomic DNA and 
may include introns. Further, in other embodiments, the d'wf4 polynucleotide includes 
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a dwf4 control control element comprising a polynucleotide selected from the group 
consisting of (i) a sequence having at least 50% identity to nucleotides 1 to 3202 of 
SEQ ID NO: 1 ; (ii) a fragment of (i) which includes a dwf4 control element; and (iii) 
complements and reverse complements of (i) or (ii). In still further embodiments, the 
5 polynucleotide includes a d-wf4 control element comprising a polynucleotide selected 
from the group consisting of (i) a sequence having at least 50% identity to nucleotides 
61 11 to 6468 corresponding to the 3' UTR of SEQ ID N0:1; (ii) a fragment of (i) 
v^hich includes a dwf4 3' UTR; and (iii) complements and reverse complements of (i) 
or (ii). In certain embodiments, the polynucleotide includes a dwf4 polynucleotide 

10 selected from the group consisting of (i) a sequence having at least 50% identity to the 
sequences corresponding to the introns of SEQ ID N0:1; (ii) a fragment of (i) which 
includes a dwf4 intro; and (iii) complements and reverse complements of (i) and (ii). 
Introns are found, for example, in the following regions: nucleotides 3424 to 3503 of 
SEQ ID NOrl; nucleotides 3829 to 3913 of SEQ ID N0:1; nucleotides 4067 to 4164 

15 of SEQ ID N0:1; nucleotides 4480 to 4531 of SEQ ID N0:1; nucleotides 4725 to 
4815 of SEQ ID N0:1; nucleotides 4895 to 5000 of SEQ ID N0:1; and nucleotides 
5 1 1 1 to 5864 of SEQ ID NO: 1 . 54. In still further embodiments, any of the 
polynucleotides described herein can operably linked to a nucleic acid molecule 
encoding a heterologous polypeptide {e.g., a cytochrome P450 polypeptide), for 

20 example, as a chimeric polynucleotide. 

In another aspect, the invention includes recombinant vectors comprising (i) 
one or more of the polynucleotides described above; and (ii) control elements 
operably linked to the one or more polynucleotides, whereby a coding sequence 
within said polynucleotide can be transcribed and translated in a host cell. In certain 

25 embodiments, the recombinant vector comprises (a) any of the polynucleotides which 
include a dwf4 control element described above (e.g., promoter or intron); and (b) a 
nucleic acid molecule comprising a coding sequence operably linked to the dwf4 
control element. 

Host cells comprising and/or transformed with any of the recombinant vectors 
30 described herein are also provided. In certain embodiments, the host cells are cultured 
ex vivo while in other embodiments, the dwf4 polynucleotide is provided the host cell 
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in vivo. In certain embodiments the DWF4 polypeptide is provided in amounts such 
that a plant is regenerated. 

In another aspect, the present invention includes a method of modulating a 
DWF4 polypeptide comprising the following steps: (a) providing a host cell as 
5 described herein; and (b) culturing said host cell under conditions whereby the dwf4 
polynucleotide included in the host cell is transcribed. In certain embodiments, the 
dwf4 polynucleotide is overexpressed. Alternatively, in other embodiments, the 
polynucleotide included in the host cell inhibits expression of dwf4. 

In yet another aspect, the present invention includes a transgenic plant 

10 comprising any of the recombinant vectors described herein. 

In yet another aspect, the invention includes a method of producing a 
recombinant polypeptide comprising the following steps: (a) providing a host cell as 
described herein; and (b) culturing said host cell under conditions whereby the 
recombinant polypeptide encoded by the coding sequence present in said recombinant 

1 5 vector is expressed. 

In a still further aspect, the invention includes a method of producing a 
transgenic plant comprising the steps of (a) introducing a polynucleotide described 
herein into a plant cell to produce a transformed plant cell; and (b) producing a 
transgenic plant from the transformed plant cell. 

20 Methods for producing a transgenic plant having an ahered phenotype relative 

to the wild-type plant comprising the following steps: introducing at least one 
polynucleotide described herein into a plant cell; and producing a transgenic plant 
from the plant cell, said transgenic plant having an altered phenotype relative to the 
wild-type plant are also included in the present invention. The altered phenotype 

25 includes altered morphological appearance and altered biochemical activity, for 
example, altered (reduced or increased) cell length in any cell or tissue, altered 
(extended or decreased) periods of flowering, altered (increased or decreased) 
branching, altered (increased or decreased) seed production, altered (increased or 
decreased) leaf size, altered (elongated or shortened) hypocotyls, altered (increased or 

30 decreased) plant height, altered heme-thiolate enzyme activity, altered 
monooxygenase activity, altered 22(x-hydroxylase activity, regulation of 
brassinosteriod synthesis, regulation of gibbereUic acid, regulation of cytokinins, 
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regulation of auxins, altered resistance to plant pathogens, altered growth at low 
temperatures, altered growth in dark conditions and altered sterol composition. In 
certain embodiments, the at least one polynucleotide is operably linked to a promoter 
selected jfrom the group consisting of a tissue-specific promoter, an inducible 
promoter or a constitutive promoter. The polynucleotide can be overexpressed or it 
can inhibit expression oidwf4. In a still further embodiment, at least two 
polynucleotides are introduced into the plant cell. Each polynucleotide is operably 
hnked to a different tissue-specific promoter such that one polynucleotide is 
overexpressed while the other inhibits expression of dwf4. 

In yet another aspect, the invention includes a method for altering the 
biochemical activity of a cell comprising the following steps: introducing at least one 
polynucleotide described herein; and culturing the cell under conditions such that the 
biochemical activity of the cell is altered. Biochemical activity includes, for example, 
altered heme-thiolate enzyme activity, altered monooxygenase activity, altered 22a- 
hydroxylase activity, regulation of gibbereUic acid, regulation of cytokinins, 
regulation of auxins, and altered sterol composition. In certain embodiments, the cell 
is cultured ex vivo. In other embodiments, the dwf4 polynucleotide is provided to the 
cell in vivo. In still other embodiments, more than one dwf4 polynucleotides are 
provided to the cell. 

In yet another aspect, the invention includes a method for regulating the cell 
cycle of a plant cell comprising the following steps providing a dwf4 polynucleotide 
to a plant cell; and expressing the dwf4 polynucleotide to provide a DWF4 
polypeptide, wherein the DWF4 polypeptide is provided in amounts such that cell 
cycling is regulated. In certain embodiments, the plant cell is provided in vitro and is 
cultured under conditions suitable for providing the DWF4 polypeptide. In still other 
embodiments, the dwf4 polynucleotide is provided in vivo. 

In yet another aspect, the invention includes an isolated DWF4 polypeptide 
comprising (i) a sequence having greater than 43% identity to SEQ ID N0:2 or (ii) 
fragments of (i) that confer a DWF4 phenotype when expressed in a host organism. 
In certain embodiments, the isolated DWF4 polypeptide comprises the amino acid 
sequence of SEQ ID N0:2. In certain embodiments, the invention includes a chimeric 
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polypeptide comprising a DWF4 polypeptide as described herein and a heterologous 
polypeptide, for example a cytochrome P450 polypeptide. 

Any of the polynucleotides or polypeptides described herein can be used in 
diagnostic assays; to generate antibodies. Further, the antibodies and fragments 
5 thereof can also be used in diagnostic assays, to produce immunogenic compositions 
or the like. 

These and other objects, aspects, embodiments and advantages of the present 
invention will readily occur to those of ordinary skill in the art in view of the 
disclosure herein. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 depicts a proposed biosynthetic pathway for BL. CR goes through at 
least two different pathways, referred to as the early C-6 oxidation (right column) and 
late C-6 oxidation (left column) pathways. Steps mediated by DWF4, CPD 

15 (Szerkeres et al. (1996), supra), DET2 (Fujioka and Skaurai (1997a), infra; Li et al. 
(1997), supra) and LKB (Yokota et al. (1997), infra) are indicated. 

Figures 2A and B depict schematic representations of the DWF4 gene and 
protein. Figure 2 A depicts the DWF4 coding sequence (1542 bp) and shows that the 
coding sequence contains eight exons and seven introns. The exons and introns range 

20 in length from 93 to 604 and 84 to 754 bp, respectively. All of the infrons are 

bordered by typical consensus spHce junctions, 5'-GU and AG-3'. Closed rectangles 
indicate exons. The T-DNA position in dwf4-l is marked with an arrow. Figure 2B 
shows the relative positions of the major domains in DWF4 cytochrome P450. All of 
the major domains found in the cytochrome P450 superfamily are conserved in 

25 DWF4. The estimated molecular mass and isoelectric point of the D WF4 protein 
were 58 kD and 7.28, respectively. Hydropathy plotting and protein locahzation 
prediction by the PSORT software package (Nakai and Kaneshia (1992) Genomics 
14:897-91 1) suggested that the protein may reside in a membrane of the endoplasmic 
reticulum as an integral protein. Mutations identified in the other dwf4 alleles are 

30 indicated. 

Figure 3 depicts alignment of cytochrome P450 proteins that exhibited the 
most similarity to DWF4 in BLAST searches. GenBank accession numbers are 
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AF044216 (DWF4; CYP90B), X87368 (CPD; CYP90A), U54770 (tomato; CYP85), 
D64003 (cyanobacteria; CYP120), U32579 (maize; CYP88), U68234 (zebrafish; 
CYP26), and M13785 (human; CYP3A3X). Dashes indicate gaps introduced to 
maximize alignment. Domains indicated in Figure 2B are highlighted in a box. Amino 
5 acid residues that are conserved >50% between the compared sequences are 

highlighted by a reverse font, and identical residues between DWF4 and CPD are 
boxed and itaUcized. Open triangles are placed under the 100% conserved residues. 
Closed triangles locate functionally important amino acid residues, for example, 
threonine (T ) at 369, which is thought to bind molecular oxygen, and cysteine (C) at 

10 516, which links to a heme prosthetic group by a thiolate bond. X's indicate mutated 
residues in d'wf4 alleles. Multiple sequence alignment was performed using PILEUP 
in the Genetics Computer Group package, and box shading was made possible by the 
ALSCRTPT package (Barton (1993) Protein Eng, 6:37-40). 

Figure 4 depicts the phylogenetic Relationship between DWF4 and Selected 

15 Cytochrome P450s. DWF4 did not cluster with the group A plant cytochrome P450s 
that are known to mediate plant-specific reactions (Durst and Nelson 1995). CYP90A, 
CYP85, and DWF4, which are thought to be involved in BR metaboUsm, branched 
from CYP88, which mediates GA biosynthesis. GenBank accession numbers for the 
group A cytochrome P450s are M32885 (avocado; CYP71A1), P48421 (Arabidopsis; 

20 CYP83), P48418 (petunia; CYP75A1), and X71658 (eggplant; CYP76A1). The 

DISTANCE utility in the Genetics Computer Group software package was employed 
to calculate the relationships. 

Figure 5 depicts a comparison of wild-type and dwf4 hypocotyl growth rates. 
Circles indicate wild-type and square indicate dwf4. Each data point represents the 

25 average of 1 0 seedlings. 

Figure 6 depicts responses to cell elongation signals. BL measixrements were 
performed with dwf4-3 and the corresponding wild-type control, Enkheim. Open bars 
indicate the wild type. Filled bars indicate dwf4. Lines above the bars represent one 
standard deviation. On the horizontal axis, "hght" refers to light-grown controls; 

30 "dark" refers to dark-grown controls; ''hyT refers to DWF4 and dwf4 plants in a hy2 
background; "GA" refers to plants grown in 10"^ M GA; "2,4-D" refers to plants 
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grown in 10^^ M 2,4-D; "-BR" refers to liquid-grown controls; and "+BR" refers to 
liquid-grown controls with 10"^ M BL. 

Figure 7 depicts pedicel elongation of dwf4 mature plants in response to 
exogenous application of BR. Measurements were performed with the BR-fed plants. 
5 dwf4~l plants were more sensitive to intermediates belonging to the late C-6 oxidation 
pathway (10'^ M 6-deoxoCT and 10'^ M 6-deoxoTE) compared with compounds in 
the early C-6 pathway (10"' M CT and 10"^ M TE). BL (10"^ M) induced almost the 
same amount of elongation with one-tenth the concentration of its precursors. Rescue 
by 22-OHCR (10'^ M), which is structurally similar to the presumed precursor CR, 

10 except for a 22a-hydroxyl functional group, shows that the only defect in dwf4 is the 
C-22 hydroxylation reaction. Complementing intermediates and BL induced dramatic 
elongation in the elongating zone of the inflorescence and pedicel, but fertility was 
not increased. Data represent the means ±SE of 15 to 20 pedicels. "CTRL" refers to 
control; " WT" refers to wild type. 

15 Figure 8 depicts the increase in inflorescence growth of three transformants 

which overexpress dwf4 as compared to wild type (Ws-2). The length of 
inflorescences of DWF4 overexpression lines increased more than 20% compared to 
that of wild type. The length of the plant was measured at maturity. Each date point 
is a mean value of more than 9 plants, except AOD4-60 which represents 2 plants. 

20 Figure 9 depicts the increase in seed production of three transformants which 

overexpress dwf4 as compared to wild type (Ws-2). Seeds were harvested from 
individual plants of each genotype (n>5). Seeds from each plant were weighed and a 
mean value calculated. The Figure shows percent increase over wild type. 

Figures 10(A)- 10(G) depict the nucleotide sequence of wild-type dwf4 (SEQ 

25 ID NO: 1 , see, also, GenBank Accession Number AF0442 1 6). The dwf4 

polynucleotide includes a coding region between nucleotides 3203 and 61 10, 
inclusive. The coding region includes the following eight exons: nucleotides 3203 to 
3423, inclusive; nucleotides 3504 to 3828, inclusive; nucleotides 3914 to 4066, 
inclusive; nucleotides 4165 to 4479, inclusive; nucleotides 4632 to 4724, inclusive; 

30 nucleotides 4816 to 4894, inclusive; nucleotides 5001 to 51 10, inclusive and 

nucleotides 5865 to 6110, inclusive. The exons are indicated by a bar beneath the 
nucleotide sequence. A 5' control region {e.g., promoter) extends from nucleotides 1 
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to 3202. A 3' untranslated region (UTR), corresponds to the region extending from 
nucleotide to 6011 to approximately nucleotide 6468 of Figure 10 (SEQ IDN0:1) 
and a TATA signal extending approximately from nucleotides 3060 to 3125. As 
described in the Examples, mutant alleles of dwf4 have also been characterized. For 
example, dwf4-l contains an approximately 20 kb insert between nucleotides 5202 
and 5203. dwf4-2 has a 9 base pair deletion corresponding to amino acids 324-326. 
In mutant allele dwf4-3, the guanine (G) residue at position 4332 is replaced with an 
adenine (A) residue to create a premature stop codon and truncate the DWF4 protein 
at amino acid 289. 

Figure 1 1 depicts the amino acid sequence of the DWF4 polypeptide 
(GenBank Accession Number AAC05093, SEQ ID N0:2). The polypeptide is 5 1 3 
amino acids in length. 

Figure 12 depicts seedling phenotypes of twelve-day-old dwf4-l, wild type, 
epi-BL-treated wild type, eind A0D4 lines grown in the light and dark, particularly 
quantification of hypocotyl and root growth. The average lengths of 16 seedlings are 
displayed with the standard deviation. Increased BR concentration supplied 
exogenously or endogenously resulted in both elongated hypocotyls and shortened 
roots. 

DETAILED DESCRIPTION 
The novel dwf4 polynucleotides and DWF4 polypeptides described herein are 
important molecules in regulating cell growth and sterol synthesis. The present 
inventors have shown that dwf4 encodes a cytochrome P450 monooxygenase having 
43% sequence identity to the protein termed Constitutive Phoromorphogenesis and 
Dwfarism (CPD). As shown in Figure 1, both CPD and DWF4 polypeptides appear 
to regulate biosynthesis of brassinosteriods, for example brassinolide (BL). However, 
unlike previously characterized proteins {e,g, CPD), DWF4 appears to act as a 
"gatekeeper" in these biosynthetic pathways in that its substrates {e.g., 6-Oxo 
campestanol and 6a-Hydroxy campestanol) are approximately 500 times more 
prevalent than the downstream molecules. Thus, the present invention represents an 
important discovery in understanding and regulating cell growth. 
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Before describing the present invention in detail, it is to be understood that 
this invention is not limited to particularly exemphfied molecules or process 
parameters as such may, of course, vary. It is also to be understood that the 
terminology used herein is for the purpose of describing particular embodiments of 
5 the invention only, and is not intended to be limiting. In addition, the practice of the 
present invention will employ, unless otherwise indicated, conventional methods of 
plant biology, virology, microbiology, molecular biology, recombinant DNA 
techniques and immunology all of which are within the ordinary skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Evans, et al, Handbook of 
10 Plant Cell Culture (1983, Macmillan Publishing Co.); Binding, Regeneration of 

Plants, Plant Protoplasts (1985, CRC Press); Sambrook, et al. Molecular Cloning: A 
Laboratory Manual (2nd Edition, 1989); DNA Cloning: A Practical Approach, vol. I 
& II (D. Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); A Practical 
Guide to Molecular Cloning (1984); and Fundamental Virology, 2nd Edition, vol I & 
1 5 II (B.N. Fields and D.M. Knipe, eds.). 

All publications, patents and patent applications cited herein, whether supra or 
infra, are hereby incorporated by reference in their entirety. 

It must be noted that, as used in this specification and the appended claims, the 
singular forms "a", "an" and "the" include plural referents unless the content clearly 
20 dictates otherwise. Thus, for example, reference to "a polypeptide" includes a mixture 
of two or more polypeptides, and the like. 

The following amino acid abbreviations are used throughout the text: 
Alanine: Ala (A) Arginine: Arg (R) 

Asparagine: Asn (N) Aspartic acid: Asp (D) 

25 Cysteine: Cys (C) Glutamine: Gin (Q) 

Glutamic acid: Glu (E) Glycine: Gly (G) 
Histidine: His (H) Isoleucine: He (I) 

Leucine: Leu (L) Lysine: Lys (K) 

Methionine: Met (M) Phenylalanine: Phe (F) 

30 Proline: Pro (P) Serine: Ser (S) 

Threonine: Thr (T) Tryptophan: Trp (W) 

Tyrosine: Tyr (Y) Vahne: Val (V) 
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Definitions 

In describing the present invention, the following terms will be employed, and 
are intended to be defined as indicated below. 

The terms "nucleic acid molecule" and "polynucleotide" are used 
5 interchangeably and refer to a polymeric form of nucleotides of any length, either 
deoxyribonucleotides or ribonucleotides, or analogs thereof This term refers only to 
the primary structure of the molecule and thus includes double- and single-stranded 
DNA and RNA. It also includes known types of modifications, for example, labels 
which are known in the art, methylation, "caps", substitution of one or more of the 

1 0 naturally occurring nucleotides with an analog, intemucleotide modifications such as, 
for example, those with uncharged linkages (e.g., methyl phosphonates, 
phosphotriesters, phosphoamidates, carbamates, etc.) and with charged linkages (e.g., 
phosphorothioates, phosphorodithioates, etc.), those containing pendant moieties, 
such as, for example proteins (including e.g., nucleases, toxins, antibodies, signal 

15 peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), 
those containing chelates (e.g., metals, radioactive metals, boron, oxidative metals, 
etc.), those containing alkylators, those with modified linkages (e.g., alpha anomeric 
nucleic acids, etc.), as well as unmodified forms of the polynucleotide. 
Polynucleotides may have any three-dimensional structure, and may perform any 

20 function, known or unknown. Nonlimiting examples of polynucleotides include a 
gene, a gene fi:agment, exons, introns, messenger RNA (mRNA), transfer RNA, 
ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched 
polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of 
any sequence, nucleic acid probes, and primers. 

25 A polynucleotide is typically composed of a specific sequence of four 

nucleotide bases: adenine (A); cytosine (C); guanine (G); and thymine (T) (uracil (U) 
for thymine (T) when the polynucleotide is RNA). Thus, the term polynucleotide 
sequence is the alphabetical representation of a polynucleotide molecule. This 
alphabetical representation can be input into databases in a computer having a central 

30 processing unit and used for bioinformatics applications such as functional genomics 
and homology searching. 
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Techniques for determining nucleic acid and amino acid "sequence identity" 
are known in the art. Typically, such techniques include determining the nucleotide 
sequence of the mRNA for a gene and/or determining the amino acid sequence 
encoded thereby, and comparing these sequences to a second nucleotide or amino acid 
sequence. In general, "identity" refers to an exact nucleotide-to-nucleotide or amino 
acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, 
respectively. Two or more sequences (polynucleotide or amino acid) can be 
compared by determining their "percent identity." The percent identity of two 
sequences, whether nucleic acid or amino acid sequences, is the number of exact 
matches between two aligned sequences divided by the length of the shorter 
sequences and multiplied by 100. An approximate alignment for nucleic acid 
sequences is provided by the local homology algorithm of Smith and Waterman, 
Advances in ApnUed Mathematics 2:482-489 (1981). This algorithm can be applied 
to amino acid sequences by using the scoring matrix developed by Dayhof£ Atlas of 
Protein Sequences and Structure . M.O. Dayhoff ed., 5 suppl. 3:353-358, National 
Biomedical Research Foundation, Washington, D.C., USA, and normaUzed by 
Gribskov, NucL Acids Res. 14(6):6745-6763 (1986). An exemplary implementation 
of this algorithm to determine percent identity of a sequence is provided by the 
Genetics Computer Group (Madison, WI) in the "BestFit" utility apphcation. The 
default parameters for this method are described in the Wisconsin Sequence Analysis 
Package Program Manual, Version 8 (1995) (available from Genetics Computer 
Group, Madison, WI). A preferred method of establishing percent identity in the 
context of the present invention is to use the MPSRCH package of programs 
copyrighted by the University of Edinburgh, developed by John F, Collins and Shane 
S. Sturrok, and distributed by IntelliGenetics, Inc. (Mountain View, CA). From this 
suite of packages the Smith- Waterman algorithm can be employed where default 
parameters are used for the scoring table (for example, gap open penalty of 12, gap 
extension penalty of one, and a gap of six). From the data generated the "Match" 
value reflects "sequence identity." Other suitable programs for calculating the percent 
identity or similarity between sequences are generally known in the art, for example, 
another alignment program is BLAST, used with default parameters. For example, 
BLASTN and BLASTP can be used using the following default parameters: genetic 
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code = standard; filter = none; strand = both; cutoff = 60; expect = 10; Matrix = 
BLOSUM62; Descriptions = 50 sequences; sort by = HIGH SCORE; Databases = 
non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank CDS translations + 
Swiss protein + Spupdate + PIR. Details of these programs can be found at the 
following internet address: http://www.ncbi.nlni,gov/cgi-bin/BLAST. 

Alternatively, the degree of sequence similarity between polynucleotides can 
be determined by hybridization of polynucleotides under conditions that form stable 
duplexes between homologous regions, followed by digestion with single-stranded- 
specific nuclease(s), and size determination of the digested fragments. Two DNA, or 
two polypeptide sequences are "substantially homologous" to each other when the 
sequences exhibit at least about 43%-60%, preferably 60-70%, more preferably 70%- 
85%, more preferably at least about 85%-90%, more preferably at least about 90%- 
95%, and most preferably at least about 95%-98% sequence identity over a defined 
length of the molecules, or any percentage between the above-specified ranges, as 
determined using the methods above. As used herein, substantially homologous also 
refers to sequences showing complete identity to the specified DNA or polypeptide 
sequence. DNA sequences that are substantially homologous can be identified in a 
Southern hybridization experiment under, for example, stringent conditions, as 
defined for that particular system. Defining appropriate hybridization conditions is 
within the skill of the art. See, e.g., Sambrook et al., supra; DNA Cloning, supra; 
Nucleic Acid Hybridization, supra. 

The degree of sequence identity between two nucleic acid molecules affects 
the efficiency and strength of hybridization events between such molecules. A 
partially identical nucleic acid sequence will at least partially inhibit a completely 
identical sequence from hybridizing to a target molecule. Inhibition of hybridization 
of the completely identical sequence can be assessed using hybridization assays that 
are well known in the art (e.g., Southern blot, Northem blot, solution hybridization, or 
the like, see Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second 
Edition, (1989) Cold Spring Harbor, N.Y.). Such assays can be conducted using 
varying degrees of selectivity, for example, using conditions varying from low to high 
stringency. If conditions of low stringency are employed, the absence of non-specific 
binding can be assessed using a secondary probe that lacks even a partial degree of 
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sequence identity (for example, a probe having less than about 30% sequence identity 
with the target molecule), such that, in the absence of non-specific binding events, the 
secondary probe will not hybridize to the target. 

When utilizing a hybridization-based detection system, a nucleic acid probe is 
chosen that is complementary to a target nucleic acid sequence, and then by selection 
of appropriate conditions the probe and the target sequence "selectively hybridize," or 
bind, to each other to form a hybrid molecule. A nucleic acid molecule that is capable 
of hybridizing selectively to a target sequence under "moderately stringent" typically 
hybridizes xmder conditions that allow detection of a target nucleic acid sequence of at 
least about 10-14 nucleotides in length having at least approximately 70% sequence 
identity with the sequence of the selected nucleic acid probe. Stringent hybridization 
conditions typically allow detection of target nucleic acid sequences of at least about 
10-14 nucleotides in length having a sequence identity of greater than about 90-95% 
with the sequence of the selected nucleic acid probe. Hybridization conditions useful 
for probe/target hybridization where the probe and target have a specific degree of 
sequence identity, can be determined as is known in the art (see, for example. Nucleic 
Acid Hvbridization: A Practical Approach , editors B.D. Hames and S.J. Higgins, 
(1985) Oxford; Washington, DC; IRL Press). 

With respect to stringency conditions for hybridization, it is well known in the 
art that numerous equivalent conditions can be employed to estabUsh a particular 
stringency by varying, for example, the following factors: the length and nature of 
probe and target sequences, base composition of the various sequences, 
concentrations of salts and other hybridization solution components, the presence or 
absence of blocking agents in the hybridization solutions (e.g., formamide, dextran 
sulfate, and polyethylene glycol), hybridization reaction temperature and time 
parameters, as well as, varying wash conditions. The selection of a particular set of 
hybridization conditions is selected following standard methods in the art (see, for 
example, Sambrook, et al., Molecular Cloning: A Laboratorv Manual Second 
Edition, (1989) Cold Spring Harbor, N.Y.). 

A "gene" as used in the context of the present invention is a sequence of 
nucleotides in a genetic nucleic acid (chromosome, plasmid, etc.) with which a 
genetic function is associated. A gene is a hereditary unit, for example of an 
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organism, comprising a polynucleotide sequence that occupies a specific physical 
location (a "gene locus" or "genetic locus") within the genome of an organism. A 
gene can encode an expressed product, such as a polypeptide or a polynucleotide (e.g., 
tRNA). Alternatively, a gene may define a genomic location for a particular 
event/function, such as the binding of proteins and/or nucleic acids, wherein the gene 
does not encode an expressed product. Typically, a gene includes coding sequences, 
such as, polypeptide encoding sequences, and non-coding sequences, such as, 
promoter sequences, polyadenlyation sequences, transcriptional regulatory sequences 
(e.g., enhancer sequences). Many eucaryotic genes have "exons" (coding sequences) 
interrupted by "introns" (non-coding sequences). In certain cases, a gene may share 
sequences with another gene(s) (e.g., overlapping genes). 

A "coding sequence" or a sequence which "encodes" a selected polypeptide, is 
a nucleic acid molecule which is transcribed (in the case of DNA) and translated (in 
the case of mRNA) into a polypeptide, for example, in vivo when placed under the 
control of appropriate regulatory sequences (or "control elements"). The boundaries 
of the coding sequence are typically determined by a start codon at the 5' (amino) 
terminus and a translation stop codon at the 3' (carboxy) terminus. A coding sequence 
can include, but is not limited to, cDNA from viral, procaryotic or eucaryotic mRNA, 
genomic DNA sequences from viral or procaryotic DNA, and even synthetic DNA 
sequences. A transcription termination sequence may be located 3' to the coding 
sequence. Other "control elements" may also be associated with a coding sequence. 
A DNA sequence encoding a polypeptide can be optimized for expression in a 
selected cell by using the codons preferred by the selected cell to represent the DNA 
copy of the desired polypeptide coding sequence. "Encoded by" refers to a nucleic 
acid sequence which codes for a polypeptide sequence, wherein the polypeptide 
sequence or a portion thereof contains an amino acid sequence of at least 3 to 5 amino 
acids, more preferably at least 8 to 10 amino acids, and even more preferably at least 
15 to 20 amino acids from a polypeptide encoded by the nucleic acid sequence. Also 
encompassed are polypeptide sequences which are immunologically identifiable with 
a polypeptide encoded by the sequence. 

Typical "control elements", include, but are not limited to, transcription 
promoters, transcription enhancer elements, transcription termination signals. 
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polyadenylation sequences (located 3' to the translation stop codon), sequences for 
optimization of initiation of translation (located 5' to the coding sequence), translation 
enhancing sequences, and translation termination sequences. Transcription promoters 
can include inducible promoters (where expression of a polynucleotide sequence 
operably linked to the promoter is induced by an analyte, cofactor, regulatory protein, 
etc.), tissue-specific promoters (where expression of a polynucleotide sequence 
operably linked to the promoter is induced only in selected tissue), repressible 
promoters (where expression of a polynucleotide sequence operably linked to the 
promoter is induced by an analyte, cofactor, regulatory protein, etc.), and constitutive 
promoters. 

A control element, such as a promoter, "directs the transcription" of a coding 
sequence in a cell when RNA polymerase will bind the promoter and transcribe the 
coding sequence into mRNA, which is then translated into the polypeptide encoded 
by the coding sequence. 

"Expression enhancing sequences" typically refer to control elements that 
improve transcription or translation of a polynucleotide relative to the expression 
level in the absence of such control elements (for example, promoters, promoter 
enhancers, enhancer elements, and translational enhancers (e.g., Shine and Delagamo 
sequences). 

"Operably linked" refers to a juxtaposition wherein the components so 
described are in a relationship permitting them to function in their intended manner. 
A control sequence "operably linked" to a coding sequence is hgated in such a way 
that expression of the coding sequence is achieved under conditions compatible with 
the control sequences. The control elements need not be contiguous with the coding 
sequence, so long as they function to direct the expression thereof Thus, for 
example, intervening untranslated yet transcribed sequences can be present between a 
promoter and the coding sequence and the promoter can still be considered "operably 
linked" to the coding sequence. 

A "heterologous sequence'* as used herein typically refers to a nucleic acid 
sequence that is not normally found in the cell or organism of interest. For example, a 
DNA sequence encoding a polypeptide can be obtained from a plant cell and 
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introduced into a bacterial cell. In this case the plant DNA sequence is 
"heterologous" to the native DNA of the bacterial cell. 

The "native sequence" or "wild-type sequence" of a gene is the polynucleotide 
sequence that comprises the genetic locus corresponding to the gene, e.g., all 
5 regulatory and open-reading jframe coding sequences required for expression of a 
completely functional gene product as they are present in the wild-type genome of an 
organism. The native sequence of a gene can include, for example, transcriptional 
promoter sequences, translation enhancing sequences, introns, exons, and poly-A 
processing signal sites. It is noted that in the general population, wild-type genes may 

10 include multiple prevalent versions that contain alterations in sequence relative to 
each other and yet do not cause a discemible pathological effect. These variations are 
designated "polymorphisms" or "allehc variations." 

"Recombinant" as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue 

15 of its origin or manipulation: (1) is not associated with all or a portion of the 
polynucleotide with which it is associated in nature; and/or (2) is linked to a 
polynucleotide other than that to which it is linked in nature. The term "recombinant" 
as used with respect to a protein or polypeptide means a polypeptide produced by 
expression of a recombinant polynucleotide. 

20 By "vector" is meant any genetic element, such as a plasmid, phage, 

transposon, cosmid, chromosome, virus etc., which is capable of transferring gene 
sequences to target cells. Generally, a vector is capable of replication when 
associated with the proper control elements. Thus, the term includes cloning and 
expression vehicles, as well as viral vectors and integrating vectors. 

25 As used herein, the term "expression cassette" refers to a molecule comprising 

at least one coding sequence operably linked to a control sequence which includes all 
nucleotide sequences required for the transcription of cloned copies of the coding 
sequence and the translation of the mRNAs in an appropriate host cell. Such 
expression cassettes can be used to express eukaryotic genes in a variety of hosts such 

30 as bacteria, blue-green algae, plant cells, yeast cells, insect cells and animal cells. 
Under the invention, expression cassettes can include, but are not limited to, cloning 
vectors, specifically designed plasmids, viruses or virus particles. The cassettes may 
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further include an origin of replication for autonomous replication in host cells, 
selectable markers, various restriction sites, a potential for high copy number and 
strong promoters. 

A cell has been "transformed" by an exogenous polynucleotide when the 
5 polynucleotide has been introduced inside the cell. The exogenous polynucleotide 
may or may not be integrated (covalently linked) into chromosomal DNA making up 
the genome of the cell. In prokaryotes and yeasts, for example, the exogenous DNA 
may be maintained on an episomal element, such as a plasmid. With respect to 
eucaryotic cells, a stably transformed cell is one in which the exogenous DNA has 

10 become integrated into the chromosome so that it is inherited by daughter cells 

through chromosome replication. This stability is demonstrated by the ability of the 
eucaryotic cell to establish cell lines or clones comprised of a population of daughter 
cells containing the exogenous DNA. 

"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and 

1 5 other such terms denoting procaryotic microorganisms or eucaryotic cell hues 

cultured as unicellular entities, are used interchangeably, and refer to cells which can 
be, or have been, used as recipients for recombinant vectors or other transfer DNA, 
and include the progeny of the original cell which has been transfected. It is 
understood that the progeny of a single parental cell may not necessarily be 

20 completely identical in morphology or in genomic or total DNA complement to the 
original parent, due to accidental or deliberate mutation. Progeny of the parental cell 
which are sufficiently similar to the parent to be characterized by the relevant 
property, such as the presence of a nucleotide sequence encoding a desired peptide, 
are included in the progeny intended by this definition, and are covered by the above 

25 terms. 

The term ''dwf4 polynucleotide" refers to a polynucleotide derived fi'om the 
dwf4 gene. The gene encodes the protein referred to herein as DWF4. DWF4 is a 
cytochrome P450 cytochrome P450 that mediates multiple 22a-hydroxylation steps in 
brassinosteroid biosynthesis (see. Figure 1). The dwf4 polynucleotide sequence and 
30 corresponding amino acid sequence are shown in Figures 1 0 and 1 1 (SEQ ID NO: 1 , 
SEQ ID NO:2 and GenBank accession No. AF044216). As shown in Figure 10, the 
dwf4 coding sequence spans the region from nucleotide positions 3203 to 61 10 and 
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the upstream 5' UTR, including the promoter region, spans nucleotide positions 1 to 
3202. A functional LI kb control element is also described in the Examples. A 3* 
UTR spans nucleotide positions 61 1 1 to approximately 6468 of SEQ ID NO: 1 . The 
term as used herein encompasses a polynucleotide including a native sequence 

5 depicted in Figure 10, as well as modifications and fragments thereof. 

The term encompasses alterations to the polynucleotide sequence, so long as 
the alteration results in a plant displaying one or more dwf4 phenotypic traits 
(described below) when the polynucleotide is expressed in a plant. Such 
modifications typically include deletions, additions and substitutions, to the native 

10 dwf4 sequence, so long as the mutation results in a plant displaying a dwf4 phenotype 
as defined below. These modifications may be deliberate, as through site-directed 
mutagenesis, or may be accidental, such as through mutations of plants which express 
the dwf4 polynucleotide or errors due to PGR ampHfication. The term encompasses 
expressed allehc variants of the wild-type dwf4 sequence which may occur by normal 

1 5 genetic variation or are produced by genetic engineering methods and which result in 
a detectable change in the wild-type dwf4 phenotype. 

The term "dwf4 phenotype" as used herein refers to any microscopic or 
macroscopic change in structure or morphology of a plant, such as a transgenic plant, 
as well as biochemical differences, which are characteristic of a dwf4 plant, compared 

20 to a progenitor, wild-type plant cultivated under the same conditions. Generally, 

morphological differences include multiple short stems, short roxmded leaves, loss of 
fertility due to reduced stamen length, and delayed development. Dark-grown dwf4 
seedlings possess short hypocotyls, open cotyledons, and developing leaves. The 
height of such plants will typically be 75% or less of the wild-type plant, more 

25 typically 50% or less of the wild-type plant, and even more typically 25%o or less of 
the wild-type plant, or any integer in between. Additional phenotypic morphological 
attributes of the dwf4 mutant are summarized in Table 1 of the examples. 
Biochemically, dwf4 hypocotyls are converted to wild-type length with the 
application of BL. 

30 A "polypeptide" is used in it broadest sense to refer to a compound of two or 

more subimit amino acids, amino acid analogs, or other peptidomimetics. The 
subunits may be linked by peptide bonds or by other bonds, for example ester, ether, 
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etc. As used herein, the term "amino acid" refers to either natural and/or unnatural or 
synthetic amino acids, including glycine and both the D or L optical isomers, and 
amino acid analogs and peptidomimetics. A peptide of three or more amino acids is 
commonly called an oUgopeptide if the peptide chain is short. If the peptide chain is 
5 long, the peptide is typically called a polypeptide or a protein. Full-length proteins, 
analogs, mutants and fragments thereof are encompassed by the definition. The terms 
also include postexpression modifications of the polypeptide, for example, 
glycosylation, acetylation, phosphorylation and the Uke. Furthermore, as ionizable 
amino and carboxyl groups are present in the molecule, a particular polypeptide may 

10 be obtained as an acidic or basic salt, or in neutral form. A polypeptide may be 

obtained directly from the source organism, or may be recombinantly or synthetically 
produced (see fiirther below). 

A "DWF4" polypeptide is a polypeptide as defined above, which is derived 
from a 22a-hydroxylase that fimctions in the brassinolide (BL) biosynthetic pathway 

1 5 (see, Figure 1). The native sequence of fiiU-length DWF4 is shown in Figure 1 1 

(SEQ ID N0:2). However, the term encompasses mutants and fragments of the native 
sequence so long as the protein ftmctions for its intended purpose. 

The term "DWF4 analog" refers to derivatives of DWF4, or fragments of such 
derivatives, that retain desired fimction, e.g., as measured in assays as described 

20 fiirther below. In general, the term "analog" refers to compounds having a native 
polypeptide sequence and structure with one or more amino acid additions, 
substitutions (generally conservative in nature) and/or deletions, relative to the native 
molecule, so long as the modifications do not destroy desired activity. Preferably, 
the analog has at least the same activity as the native molecule. Methods for making 

25 polypeptide analogs are known in the art and are described fiirther below. 

Particularly preferred analogs include substitutions that are conservative in 
nature, i.e., those substitutions that take place within a family of amino acids that are 
related in their side chains. Specifically, amino acids are generally divided into four 
famihes: (1) acidic - aspartate and glutamate; (2) basic ~ lysine, arginine, histidine; 

30 (3) non-polar - alanine, valine, leucine, isoleucine, proline, phenylalanine, 

methionine, tryptophan; and (4) uncharged polar ~ glycine, asparagine, glutamine, 
cysteine, serine threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are 
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sometimes classified as aromatic amino acids. For example, it is reasonably 
predictable that an isolated replacement of leucine with isoleucine or valine, an 
aspartate with a glutamate, a threonine with a serine, or a similar conservative 
replacement of an amino acid with a structurally related amino acid, will not have a 
5 major effect on the biological activity. It is to be understood that the terms include 
the various sequence polymorphisms that exist, wherein amino acid substitutions in 
the protein sequence do not affect the essential functions of the protein. 

By "purified" and "isolated" is meant, when referring to a polypeptide or 
polynucleotide, that the molecule is separate and discrete from the whole organism 

10 with which the molecule is found in nature; or devoid, in whole or part, of sequences 
normally associated with it in nature; or a sequence, as it exists in nature, but having 
heterologous sequences (as defined below) in association therewith. It is to be 
understood that the term "isolated" with reference to a polynucleotide intends that the 
polynucleotide is separate and discrete from the chromosome from which the 

15 polynucleotide may derive. The term "purified" as used herein preferably means at 
least 75% by weight, more preferably at least 85% by weight, more preferably still at 
least 95% by weight, and most preferably at least 98% by weight, of biological 
macromolecules of the same type are present. An "isolated polynucleotide which 
encodes a particular polypeptide" refers to a nucleic acid molecule which is 

20 substantially free of other nucleic acid molecules that do not encode the subject 
polypeptide; however, the molecule may include some additional bases or moieties 
which do not deleteriously affect the basic characteristics of the composition. 

By "fragment" is intended a polypeptide or polynucleotide consisting of only a 
part of the intact sequence and structure of the reference polypeptide or 

25 polynucleotide, respectively. The fragment can include a 3' or C-terminal deletion or 
a 5' or N-terminal deletion, or even an internal deletion, of the native molecule. A 
polynucleotide fragment of a dwf4 sequence will generally include at least about 15 
contiguous bases of the molecule in question, more preferably 18-25 contiguous 
bases, even more preferably 30-50 or more contiguous bases of the dwf4 molecule, or 

30 any integer between 15 bases and the full-length sequence of the molecule. 

Fragments which provide at least one dwf4 phenotype as defined above are useful in 
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the production of transgenic plants. Fragments are also useful as oligonucleotide 
probes, to find additional dwf4 sequences. 

Similarly, a polypeptide fragment of a DWF4 molecule will generally include 
at least about 5-10 contiguous amino acid residues of the full-length molecule, 
5 preferably at least about 15-25 contiguous amino acid residues of the full-length 
molecule, and most preferably at least about 20-50 or more contiguous amino acid 
residues of the full-length DWF4 molecule, or any integer between 10 amino acids 
and the full-length sequence of the molecule. Such fragments are useful for the 
production of antibodies and the like. 

10 By "transgenic plant" is meant a plant into which one or more exogenous 

polynucleotides have been introduced. Examples of means by which this can be 
accomplished are described below, and include Agrobacterium-mediated 
transformation, bioUstic methods, electroporation, and the hke. In the context of the 
present invention, the transgenic plant contains a polynucleotide which is not 

15 normally present in the corresponding wild-type plant and which confers at least one 
dwf4 phenotypic trait to the plant. The transgenic plant therefore exhibits altered 
structure, morphology or biochemistry as compared with a progenitor plant which 
does not contain the transgene, when the transgenic plant and the progenitor plant are 
cultivated under similar or equivalent growth conditions. Such a plant containing the 

20 exogenous polynucleotide is referred to here as an generation transgenic plant. 
Transgenic plants may also arise from sexual cross or by selfmg of transgenic plants 
into which exogenous polynucleotides have been introduced. Such a plant containing 
the exogenous nucleic acid is also referred to here as an generation transgenic 
plant. Transgenic plants which arise from a sexual cross with another parent line or by 

25 selfing are "descendants or the progeny" of a Rj plant and are generally called F„ 
plants or S^ plants, respectively, n meaning the number of generations. 



General Overview 

In this report, we present morphological, biochemical, and molecular analysis 
30 of a novel gene, dwf4, isolated from Arabidopsis. Morphologically, dwf4 plants 

display a dramatic reduction in the length of many different organs examined, and this 
size reduction is attributable to a defect in cell elongation. Biochemically, dwf4 
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hypocotyls were converted completely to wild-type length with the appUcation of BL, 
suggesting a deficiency in BRs. In agreement with this, BR intermediate feeding 
analysis, indicated that dwf4 encodes a cytochrome P450 that mediates multiple 22a- 
hydroxylation steps in brassinosteriod biosynthesis. Sequencing of the dwf4 locus 
5 and analysis of the protein product are described. 

The molecules of the present invention are therefore useful in the production 
of transgenic plants which display at least one dwf4 phenotype, so that the resulting 
plants have altered structure or morphology. The present invention particularly 
provides for altered structure or morphology such as reduced cell length, extended 

10 flowering periods, increased size of leaves or fruit, increased branching, increased 
seed production and altered sterol composition relative wild-type plants. The DWF4 
polypeptides can be expressed to engineer a plant with desirable properties. The 
engineering is accomplished by transforming plants with nucleic acid constructs 
described herein which may also comprise promoters and secretion signal peptides. 

15 The transformed plants or their progenies are screened for plants that express the 
desired polypeptide. 

Engineered plants exhibiting the desired altered structure or morphology can 
be used in plant breeding or directly in agricultural production or industrial 
appUcations. Plants having the altered polypeptide can be crossed with other altered 

20 plants engineered with alterations in other growth modulation enzymes, proteins or 
polypeptides to produce lines with even further enhanced altered structural 
morphology characteristics compared to the parents or progenitor plants. 



Isolation of Nucleic Acid Sequences fi-om Plants 

25 The isolation of dwf4 sequences from the polynucleotides of the invention 

may be accomplished by a number of techniques. For instance, oUgonucleotide 
probes based on the sequences disclosed here can be used to identify the desired gene 
in a cDNA or genomic DNA library from a desired plant species. To construct 
genomic libraries, large segments of genomic DNA are generated by random 

30 fragmentation, e.g. using restriction endonucleases, and are ligated with vector DNA 
to form concatemers that can be packaged into the appropriate vector. To prepare a 
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library of tissue-specific cDNAs, mRNA is isolated from tissues and a cDNA library 
which contains the gene transcripts is prepared from the mRNA. 

The cDNA or genomic library can then be screened using a probe based upon 
the sequence of a cloned gene such as the polynucleotides disclosed here. Probes may 
be used to hybridize with genomic DNA or cDNA sequences to isolate homologous 
genes in the same or different plant species. Alternatively, the nucleic acids of 
interest can be amphfied from nucleic acid samples using amplification techniques. 
For instance, polymerase chain reaction (PGR) technology to ampUfy the sequences 
of the genes directly from mRNA, from cDNA, from genomic libraries or cDNA 
libraries. PCR.RTM. and other in vitro amplification methods may also be usefiil, for 
example, to clone nucleic acid sequences that code for proteins to be expressed, to 
make nucleic acids to use as probes for detecting the presence of the desired mRNA in 
samples, for nucleic acid sequencing, or for other purposes. 

Appropriate primers and probes for identifyuig <fw/^-specific genes from plant 
tissues are generated from comparisons of the sequences provided herein. For a 
general overview of PGR see Innis et al. eds, PCR Protocols: A Guide to Methods and 
Applications, Academic Press, San Diego (1990). Appropriate primers for this 
invention include, for instance, those primers described in the Examples and Sequence 
Listings, as well as other primers derived from the dwf4 sequences disclosed herein. 
Suitable amphfications conditions may be readily determined by one of skill in the art 
in view of the teachings herein, for example, including reaction components and 
amplification conditions as follows: 10 mM Tris-HCl, pH 8.3, 50 mM potassium 
chloride, 1.5 mM magnesium chloride, 0.001% gelatin, 200 ^M dATP, 200 jiM 
dCTP, 200 dGTP, 200 dTTP, 0.4 primers, and 100 units per mL Taq 
polymerase; 96°C for 3 min., 30 cycles of 96°G for 45 seconds, 50°G for 60 seconds, 
72°G for 60 seconds, followed by 72°G for 5 min. 

Polynucleotides may also be synthesized by well-known techniques as 
described in the technical Uterature. See, e.g., Garrutiiers, et al. (1982) Cold Spring 
Harbor Symp. Quant. Biol. 47:411-418, and Adams, etal. (1983) J. Am. Chem. Sac. 
1 05 :661 . Double sfranded DNA fragments may then be obtained either by 
synthesizing the complementary sfrand and annealing the strands together under 
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appropriate conditions, or by adding the complementary strand using DNA 
polymerase with an appropriate primer sequence. 

The polynucleotides of the present invention may also be used to isolate or 
create other mutant cell gene alleles. Mutagenesis consists primarily of site-directed 
5 mutagenesis followed by phenotypic testing of the altered gene product. Some of the 
more commonly employed site-directed mutagenesis protocols take advantage of 
vectors that can provide single stranded as well as double stranded DNA, as needed. 
Generally, the mutagenesis protocol with such vectors is as follows. A mutagenic 
primer, i.e., a primer complementary to the sequence to be changed, but consisting of 

10 one or a small number of altered, added, or deleted bases, is synthesized. The primer 
is extended in vitro by a DNA polymerase and, after some additional manipulations, 
the now double-stranded DNA is transfected into bacterial cells. Next, by a variety of 
methods, the desired mutated DNA is identified, and the desired protein is purified 
from clones containing the mutated sequence. For longer sequences, additional 

15 cloning steps are often required because long inserts (longer than 2 kilobases) are 
imstable in those vectors. Protocols are known to one skilled in the art and kits for 
site-directed mutagenesis are widely available from biotechnology supply companies, 
for example from Amersham Life Science, Inc. (ArUngton Heights, 111.) and 
Stratagene Cloning Systems (La Jolla, Calif). 

20 

Control elements 

Regulatory regions can be isolated from the dwf4 gene and used in 
recombinant constructs for modulating the expression of the dwf4 gene or a 
heterologous gene in vitro and/or in vivo. As shown in Figure 10, the coding region 
25 of the dwf4 gene (designated by the Ught grey bar) begins at nucleotide position 1 133. 
The region of the gene spanning nucleotide positions 990-1132 of Figure 10 includes 
the dwf4 promoter. This region may be used in its entirety or fragments of the region 
may be isolated which provide the ability to direct expression of a coding sequence 
linked thereto, 

30 Thus, promoters can be identified by analyzing the 5' sequences of a genomic 

clone corresponding to the (iw/Z-specific genes described here. Sequences 
characteristic of promoter sequences can be used to identify the promoter. Sequences 
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controlling eukaryotic gene expression have been extensively studied. For instance, 
promoter sequence elements include the TATA box consensus sequence (TATAAT), 
which is usually 20 to 30 base pairs upstream of the transcription start site. In most 
instances the TATA box is required for accurate transcription initiation. In plants, 
5 further upstream from the TATA box, at positions -80 to -1 00, there is typically a 
promoter element with a series of adenines surrounding the trinucleotide G (or T) N 
G. (See, J. Messing et al., in Genetic Engineering in Plants, pp. 221-227 (Kosage, 
Meredith and HoUaender, eds. (1983)). Methods for identifying and characterizing 
promoter regions in plant genomic DNA are described, for example, in Jordano et al. 

10 (1989) Plant Cell 1:855-866; Bustos et al (1989) Plant Cell 1:839-854; Green et al. 
(1988) EMBO 1 7:4035-4044; Meier et al. (1991) Plant Cell 3:309-316; and Zhang et 
al (1996) Plant Physiology 110:1069-1079). 

Additionally, the promoter region may include nucleotide substitutions, 
insertions or deletions that do not substantially affect the binding of relevant DNA 

1 5 binding proteins and hence the promoter function. It may, at times, be desirable to 
decrease the binding of relevant DNA binding proteins to "silence" or "down- 
regulate" a promoter, or conversely to increase the binding of relevant DNA binding 
proteins to "enhance" or "up-regulate" a promoter. In such instances, the nucleotide 
sequence of the promoter region may be modified by, e.g., inserting additional 

20 nucleotides, changing the identity of relevant nucleotides, including use of 
chemically-modified bases, or by deleting one or more nucleotides. 

Promoter function can be assayed by methods known in the art, preferably by 
measuring activity of a reporter gene operatively linked to the sequence being tested 
for promoter function. Examples of reporter genes include those encoding luciferase, 

25 green fluorescent protein, GUS, neo, cat and bar. 

Polynucleotides comprising untranslated (UTR) sequences and intron/exon 
junctions are also within the scope of the invention. UTR sequences include introns 
and 5' or 3' untranslated regions ( 5' UTRs or 3' UTRs). As shown in Figures 2 and 
10, the dwf4 gene sequence includes eight exons and seven introns. These portions of 

30 the dwf4 gene especially UTRs, can have regulatory functions related to, for example, 
translation rate and mRNA stability. Thus, these portions of the gene can be isolated 
for use as elements of gene constructs for expression of polynucleotides encoding 
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desired polypeptides. The 5' control element region of dwf 4 extends from nucleotides 
1 through 3202 of SEQ ID NO:L Further, as described in Example 11, a 1.1 kb 
portion of this region that is directly upstream of the translation initiation site contains 
elements necessary for transcriptional control of dwf4. In contrast, a 280 bp fragment 
5 of the dwf4 control element region that includes the TATA-Hke region does not 
appear to contain all of the necessary transcriptional control elements (see, Example 
11). 

Introns of genomic DNA segments may also have regulatory ftmctions. 
Sometimes promoter elements, especially transcription enhancer or suppressor 

10 elements, are found within introns. Also, elements related to stabihty of 

heteronuclear RNA and efficiency of transport to the cytoplasm for translation can be 
found in intron elements. Thus, these segments can also find use as elements of 
expression vectors intended for use to transform plants. 

The introns, UTR sequences and intron/exon junctions can vary from the 

1 5 native sequence. Such changes from those sequences preferably will not affect the 
regulatory activity of the UTRs or intron or intron/exon junction sequences on 
expression, transcription, or translation. However, in some instances, down- 
regulation of such activity may be desired to modulate traits or phenotypic or in vitro 
activity. 

20 

Use of Nucleic Acids of the Invention to Inhibit Gene Expression 

The isolated sequences prepared as described herein, can be used to prepare 
expression cassettes useftil in a number of techniques. For example, expression 
cassettes of the invention can be used to suppress (underexpress) endogenous dwf4 

25 gene expression. Inhibiting expression can be useful, for instance, in suppressing the 
phenotype (e.g., dwarf appearance, 22a-hydroxylase activity) exhibited by dwf4 
plants. Further, the inhibitory polynucleotides of the present invention can also be 
used in combination with overexpressing constructs described below, for example, 
using suitable tissue-specific promoters linked to polynucleotides described herein. In 

30 this way, the polynucleotides can be used to promote dwf4 phenotypes {e,g., activity) 
in selected tissue and, at the same time, inhibit dwf4 phenotypes (e.g., activity) in 
different tissue(s). 
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A niunber of methods can be used to inhibit gene expression in plants. For 
instance, antisense technology can be conveniently used. To accomplish this, a 
nucleic acid segment from the desired gene is cloned and operably linked to a 
promoter such that the antisense strand of RNA will be transcribed. The expression 
5 cassette is then transformed into plants and the antisense strand of RNA is produced. 
In plant cells, it has been suggested that antisense RNA inhibits gene expression by 
preventing the accumulation of mRNA which encodes the enzyme of interest, see, 
e.g., Sheehy et al (1988) Proc. Nat Acad, Sci USA 85:8805-8809, and Hiatt et al., 
U.S. Patent Number 4,801,340. 

1 0 The nucleic acid segment to be introduced generally will be substantially 

identical to at least a portion of the endogenous gene or genes to be repressed. The 
sequence, however, need not be perfectly identical to inhibit expression. The vectors 
of the present invention can be designed such that the inhibitory effect apphes to other 
proteins within a family of genes exhibiting homology or substantial homology to the 

15 target gene. 

For antisense suppression, the introduced sequence also need not be full length 
relative to either the primary transcription product or fully processed mRNA. 
Generally, higher homology can be used to compensate for the use of a shorter 
sequence. Furthermore, the introduced sequence need not have the same intron or 

20 exon pattem, and homology of non-coding segments may be equally effective. 
Normally, a sequence of between about 30 or 40 nucleotides and about full length 
nucleotides should be used, though a sequence of at least about 100 nucleotides is 
preferred, a sequence of at least about 200 nucleotides is more preferred, and a 
sequence of at least about 500 nucleotides is especially preferred. It is to be 

25 understood that any integer between the above-recited ranges is intended to be 
captured herein. 

Catalytic RNA molecules or ribozymes can also be used to inhibit expression 
of dwf4 genes. It is possible to design ribozymes that specifically pair with virtually 
any target RNA and cleave the phosphodiester backbone at a specific location, 
30 thereby functionally inactivating the target RNA, In carrying out this cleavage, the 
ribozyme is not itself altered, and is thus capable of recycling and cleaving other 
molecules, making it a true enzyme. The inclusion of ribozyme sequences within 
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antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the 
activity of the constructs. 

A number of classes of ribozymes have been identified. One class of 
ribozymes is derived from a number of small circular RNAs which are capable of 
5 self-cleavage and replication in plants. The RNAs rephcate either alone (viroid 
RNAs) or with a helper virus (satellite RNAs). Examples include RNAs from 
avocado sunblotch viroid and the satellite RNAs from tobacco ringspot virus, lucerne 
transient streak virus, velvet tobacco mottle virus, solanum nodiflorum mottle virus 
and subterranean clover mottle virus. The design and use of target RNA-specific 
10 ribozymes is described in Haseloff et al (1988) Nature 334:585-591. 

Another method of suppression is sense suppression. Introduction of 
expression cassettes in which a nucleic acid is configured in the sense orientation with 
respect to the promoter has been shown to be an effective means by which to block 
the transcription of target genes. For an example of the use of this method to 
1 5 modulate expression of endogenous genes see, Napoli et al ( 1 990) The Plant Cell 
2:279-289 and U.S. Patent Numbers 5,034,323, 5,231,020, and 5,283,184. 

Generally, where inhibition of expression is desired, some franscription of the 
introduced sequence occurs. The effect may occur where the infroduced sequence 
contains no coding sequence per se, but only intron or untranslated sequences 
homologous to sequences present in the primary franscript of the endogenous 
sequence. The introduced sequence generally will be substantially identical to the 
endogenous sequence intended to be repressed. This minimal identity will typically 
be greater than about 50%-65%, but a higher identity might exert a more effective 
repression of expression of the endogenous sequences. Substantially greater identity 
of more than about 80% is preferred, though about 95% to absolute identity would be 
most preferred. It is to be understood that any integer between the above-recited 
ranges is intended to be captured herein. As with antisense regulation, the effect 
should apply to any other proteins within a similar family of genes exhibiting 
homology or substantial homology. 

For sense suppression, the introduced sequence in the expression cassette, 
needing less than absolute identity, also need not be frill length, relative to either the 
primary transcription product or frilly processed mRNA. This may be preferred to 



34 



2225-0001 
PATENT 

avoid concurrent production of some plants which are overexpressers. A higher 
identity in a shorter than full length sequence compensates for a longer, less identical 
sequence. Furthermore, the introduced sequence need not have the same intron or 
exon pattern, and identity of non-coding segments will be equally effective. 
Normally, a sequence of the size ranges noted above for antisense regulation is used. 

Use of Nucleic Acids of the Invention to Enhance Gene Expression 

In addition to inhibiting certain features of a plant, the polynucleotides of the 
invention can be used to increase certain features such as extending flowering, 
producing larger leaves or fruit, producing increased branching and increasing seed 
production. This can be accomphshed by the overexpression of dwf4 polynucleotides. 

The exogenous dwf4 polynucleotides do not have to code for exact copies of 
the endogenous dwf4 proteins. Modified DWF4 protein chains can also be readily 
designed utilizing various recombinant DNA techniques well known to those skilled 
in the art and described for instance, in Sambrook et al., supra. Hydroxylamine can 
also be used to introduce single base mutations into the coding region of the gene 
(Sikorski et al (1991) Meth. Enzymol. 194: 302-318). For example, the chains can 
vary from the naturally occurring sequence at the primary structure level by amino 
acid substitutions, additions, deletions, and the like. These modifications can be used 
in a number of combinations to produce the final modified protein chain. 

It will be apparent that the polynucleotides described herein can be used in a 
variety of combinations. For example, the polynucleotides can be used to produce 
different phenotypes in the same organism, for instance by using tissue-specific 
promoters to overexpress a dwf4 polynucleotide in certain tissues {e.g., leaf tissue) 
while at the same time using tissue-specific promoters to inhibit expression of dwf4 in 
other tissues. In addition, fusion proteins of the polynucleotides described herein with 
other known polynucleotides (e.g., polynucleotides encoding products involved in the 
BR pathway) can be constructed and employed to obtain desired phenotypes. 

Any of the dwf4 polynucleotides described herein can also be used in standard 
diagnostic assays, for example, in assays mRNA levels (see, Sambrook et al, supra); 
as hybridization probes, e.g., in combination with appropriate means, such as a label. 
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for detecting hybridization (see, Sambrook et al, supra); as primers, e.g., for PGR 
(see, Sambrook et al., supra); attached to solid phase supports and the hke. 

Preparation of Recombinant Vectors 
5 To use isolated sequences in the above techniques, recombinant DNA vectors 

suitable for transformation of plant cells are prepared. Techniques for transforming a 
wide variety of higher plant species are well known and described further below as 
well as in the technical and scientific literature. See, for example, Weising et al (1988) 
Ann. Rev. Genet. 22:421-477. A DNA sequence coding for the desired polypeptide, 

10 for example a cDNA sequence encoding the full length DWF4 protein, will preferably 
be combined with transcriptional and translational initiation regulatory sequences 
which will direct the transcription of the sequence from the gene in the intended 
tissues of the transgenic plant. 

Such regulatory elements include but are not limited to the promoters derived 

15 from the genome of plant cells (e.g., heat shock promoters such as soybean hspl7.5-E 
or hspl7.3-B (Gurley et al. (1986) MoL Cell. Biol. 6:559-565); the promoter for the 
small subunit of RUBISCO (Coruzzi et al. (1984) EMBOJ, 3:1671-1680; Broglie et 
al (1984) Science 224:838-843); the promoter for the chlorophyll a^ binding protein) 
or from plant viruses viral promoters such as the 35S RNA and 19S RNA promoters 

20 of CaMV (Brisson et al (1984) Nature 3\0\5\\'5\A\ or the coat protein promoter of 
TMV (Takamatsu et al. (1987) EMBO J. 6:307-31 1), cytomegalovirus hCMV 
immediate early gene, the early or late promoters of SV40 adenovirus, the lac system, 
the trp system, the TAG system, the TRG system, the major operator and promoter 
regions of phage A, the control regions of fd coat protein, the promoter for 

25 3-phosphoglycerate kinase, the promoters of acid phosphatase, heat shock promoters 
(e.g., as described above) and the promoters of the yeast alpha-mating factors. 

In construction of recombinant expression cassettes of the invention, a plant 
promoter fragment may be employed which will direct expression of the gene in all 
tissues of a regenerated plant. Such promoters are referred to herein as "constitutive" 

30 promoters and are active under most environmental conditions and states of 

development or cell differentiation. Examples of constitutive promoters include the 
cauhflower mosaic virus (GaMV) 35S transcription initiation region, the T-DNA 
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mannopine synthetase promoter (e.g., the T- or 2'- promoter derived from T-DNA of 
Agrobacterium tumafaciens), and other transcription initiation regions from various 
plant genes known to those of skill. 

Alternatively, the plant promoter may direct expression of the polynucleotide 

5 of the invention in a specific tissue (tissue-specific promoters) or may be otherwise 
under more precise environmental control (inducible promoters). Examples of 
tissue-specific promoters under developmental control include promoters that initiate 
transcription only in certain tissues, such as fruit, seeds, or flowers such as tissue- or 
developmental-specific promoter, such as, but not limited to the dwf4 promoter, the 

10 CHS promoter, the PATATIN promoter, etc. The tissue specific E8 promoter from 
tomato is particularly usefiil for directing gene expression so that a desired gene 
product is located in fruits. 

Other suitable promoters include those from genes encoding embryonic 
storage proteins. Examples of environmental conditions that may affect transcription 

15 by inducible promoters include anaerobic conditions, elevated temperature, or the 
presence of light. If proper polypeptide expression is desired, a polyadenylation 
region at the 3 '-end of the coding region should be included. The polyadenylation 
region can be derived from the natural gene, from a variety of other plant genes, or 
from T-DNA. In addition, the promoter itself can be derived from the dwf4 gene, as 

20 described above. 

The vector comprising the sequences (e.g., promoters or coding regions) from 
genes of the invention will typically comprise a marker gene which confers a 
selectable phenotype on plant cells. For example, the marker may encode biocide 
resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, 

25 bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosluforon 
or Basta. 

Production of Transgenic Plants 

DNA constructs of the invention may be introduced into the genome of the 
30 desired plant host by a variety of conventional techniques. For reviews of such 

techniques see, for example, Weissbach & Weissbach Methods for Plant Molecular 
Biology (1988, Academic Press, N.Y.) Section VIII, pp. 421-463; and Grierson & 
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Corey, Plant Molecular Biology (1988, 2d Ed.), Blackie, London, Ch. 7-9. For 
example, the DNA construct may be introduced directly into the genomic DNA of the 
plant cell using techniques such as electroporation and microinjection of plant cell 
protoplasts, or the DNA constructs can be introduced directly to plant tissue using 
5 biolistic methods, such as DNA particle bombardment (see, e.g., Klein et al (1987) 
Nature 327:70-73). Altematively, the DNA constructs may be combined with 
suitable T-DNA flanking regions and introduced into a conventional Agrobacterium 
txmiefaciens host vector, Agrobacterium tumefaciens-mediated transformation 
techniques, including disarming and use of binary vectors, are vv^ell described in the 

10 scientific literature. See, for example Horsch et al (1984) Science 233:496-498, and 
Fraley et al (1983) Proc. Natl Acad. Set USA 80:4803, The virulence functions of 
the Agrobacterium tumefaciens host will direct the insertion of the construct and 
adjacent marker into the plant cell DNA when the cell is infected by the bacteria using 
binary T DNA vector (Bevan (1984) Nuc. Acid Res, 12:871 1-8721) or the 

15 co-cultivation procedure (Horsch et al (1985) Science 227:1229-1231). Generally, the 
Agrobacterium transformation system is used to engineer dicotyledonous plants 
(Bevan et al (1982) Ann. Rev, Genet 16:357-384; Rogers et al (1986) Methods 
EnzymoL 1 18:627-641). The Agrobacterium transformation system may also be used 
to transform, as well as transfer, DNA to monocotyledonous plants and plant cells. 

20 (see Hemalsteen et al (1984) EMBO J 3:3039-3041; Hooykass-Van Slogteren et al 

(1984) Nature 311:763-764; Grimsley et al (1987) Nature 325:1677-179; Boulton et 
al (1989) Plant MoL Biol. 12:31-40.; and Gould et al (1991) Plant Physiol 
95:426-434). 

Alternative gene transfer and transformation methods include, but are not 
25 Hmited to, protoplast transformation through calcium-, polyethylene glycol (PEG)- or 
electroporation-mediated uptake of naked DNA (see Paszkowski et al. (1984) EMBO 
J3:2717-2722, Potrykus et al. (1985) Molec. Gen. Genet. 199:169-177; Fromm et al. 

(1985) Proc. Nat. Acad. Sci. USA 82:5824-5828; and Shimamoto (1989) Nature 
338:274-276) and electroporation of plant tissues (D'Halluin et al. (1992) Plant Cell 

30 4:1495-1505). Additional methods for plant cell transformation include 

microinjection, sihcon carbide mediated DNA uptake (Kaeppler et al, (1990) Plant 
Cell Reporter 9:415-418), and microprojectile bombardment (see Klein et al. (1988) 
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Proc. Nat. Acad. Set USA 85:4305-4309; and Gordon-Kamm et al. (1990) Plant Cell 
2:603-618). 

Transformed plant cells which are produced by any of the above 
transformation techniques can be cultured to regenerate a whole plant which possesses 
5 the transformed genotype and thus the desired phenotype. Such regeneration 

techniques rely on manipulation of certain phytohormones in a tissue culture growth 
medium, typically relying on a biocide and/or herbicide marker which has been 
introduced together with the desired nucleotide sequences. Plant regeneration from 
cultured protoplasts is described in Evans, et al., "Protoplasts Isolation and Culture" in 

10 Handbook of Plant Cell Culture, pp. 124-176, MacmiUian PubUshing Company, New 
York, 1983; and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73, CRC 
Press, Boca Raton, 1985. Regeneration can also be obtained from plant callus, 
explants, organs, pollens, embryos or parts thereof Such regeneration techniques are 
described generally in Klee et al (1987) Ann. Rev. of Plant Phys. 38:467-486. 

1 5 The nucleic acids of the invention can be used to confer desired traits on 

essentially any plant. A wide variety of plants and plant cell systems may be 
engineered for the desired physiological and agronomic characteristics described 
herein using the nucleic acid constructs of the present invention and the various 
transformation methods mentioned above. In preferred embodiments, target plants 

20 and plant cells for engineering include, but are not limited to, those 

monocotyledonous and dicotyledonous plants, such as crops including grain crops 
(e.g., wheat, maize, rice, millet, barley), fruit crops (e.g., tomato, apple, pear, 
strawberry, orange), forage crops (e.g., alfalfa), root vegetable crops (e.g., carrot, 
potato, sugar beets, yam), leafy vegetable crops (e.g., lettuce, spinach); flowering 

25 plants (e.g., petunia, rose, chrysanthemum), conifers and pine trees (e.g., pine fir, 
spruce); plants used in phytoremediation (e.g., heavy metal accumulating plants); oil 
crops (e.g., sunflower, rape seed) and plants used for experimental purposes (e.g., 
Arabidopsis). Thus, the invention has use over a broad range of plants, including, but 
not limited to, species from the genera Asparagus, Avena, Brassica, Citrus, CitruUus, 

30 Capsicum, Cucurbita, Daucus, Glycine, Hordeum, Lactuca, Lycopersicon, Malus, 
Manihot, Nicotiana, Oryza, Persea, Pisum, Pyrus, Pranus, Raphanus, Secale, 
Solanum, Sorghum, Triticum, Vitis, Vigna, and Zea. 
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One of skill in the art will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be introduced 
into other plants by sexual crossing. Any of a number of standard breeding 
techniques can be used, depending upon the species to be crossed. 
5 A transformed plant cell, callus, tissue or plant may be identified and isolated 

by selecting or screening the engineered plant material for traits encoded by the 
marker genes present on the transforming DNA. For instance, selection may be 
performed by growing the engineered plant material on media containing an 
inhibitory amount of the antibiotic or herbicide to which the transforming gene 

10 construct confers resistance. Further, transformed plants and plant cells may also be 
identified by screening for the activities of any visible marker genes (e.g., the 
p-glucuronidase, luciferase, B or CI genes) that may be present on the recombinant 
nucleic acid constructs of the present invention. Such selection and screening 
methodologies are well known to those skilled in the art. 

1 5 Physical and biochemical methods also may be used to identify plant or plant 

cell transformants containing the gene constructs of the present invention. These 
methods include but are not limited to: 1) Southern analysis or PGR amphfication for 
detecting and determining the structure of the recombinant DNA insert; 2) Northem 
blot, SI RNase protection, primer-extension or reverse transcriptase-PCR 

20 amplification for detecting and examining RNA transcripts of the gene constructs; 3) 
enzymatic assays for detecting enzyme or ribozyme activity, where such gene 
products are encoded by the gene construct; 4) protein gel electrophoresis. Western 
blot techniques, immunoprecipitation, or enzyme-hnked immunoassays, where the 
gene construct products are proteins. Additional techniques, such as in situ 

25 hybridization, enzyme staining, and immunostaining, also may be used to detect the 
presence or expression of the recombinant construct in specific plant organs and 
tissues. The methods for doing all these assays are well known to those skilled in the 
art. 

Effects of gene manipulation using the methods of this invention can be 
30 observed by, for example, northem blots of the RNA (e.g., mRNA) isolated firom the 
tissues of interest. Typically, if the amount of mRNA has increased, it can be 
assumed that the endogenous dwf4 gene is being expressed at a greater rate than 
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before. Other methods of measuring DWF4 activity can be used. For example, cell 
length can be measured at specific times. Because dwf4 affects the BR biosynthetic 
pathway, an assay that measures the amount of BL can also be used. Such assays are 
known in the art. Different types of enzymatic assays can be used, depending on the 
5 substrate used and the method of detecting the increase or decrease of a reaction 
product or by-product. In addition, the levels of DWF4 protein expressed can be 
measured immimochemically, i.e., ELISA, RIA, EIA and other antibody based assays 
well known to those of skill in the art, by electrophoretic detection assays (either with 
staining or western blotting), and sterol (BL) detection assays. 

1 0 The transgene may be selectively expressed in some tissues of the plant or at 

some developmental stages, or the transgene may be expressed in substantially 
all plant tissues, substantially along its entire life cycle. However, any combinatorial 
expression mode is also applicable. 

The present invention also encompasses seeds of the transgenic plants 

1 5 described above wherein the seed has the transgene or gene construct. The present 
invention further encompasses the progeny, clones, cell Hues or cells of the transgenic 
plants described above wherein said progeny, clone, cell line or cell has the transgene 
or gene construct. 

20 Polvpeptides 

The present invention also includes DWF4 polypeptides, including such 
polypeptides as a fusion, or chimeric protein product (comprising the protein, 
fragment, analogue, mutant or derivative joined via a peptide bond to a heterologous 
protein sequence (of a different protein)). Such a chimeric product can be made by 

25 ligating the appropriate nucleic acid sequences encoding the desired amino acid 

sequences to each other by methods known in the art, in the proper coding frame, and 
expressing the chimeric product by methods commonly known in the art. 

As noted above, DWF4 phenotype includes any macroscopic, microscopic or 
biochemical changes which are characteristic of over- or imder-expression of dwf4. 

30 Thus, DWF4 polypeptide phenotype (e.g., activities) can include any activity that is 
exhibited by the native DWF4 polypeptide including, for example, in vitro, in vivo. 
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biological, enzymatic, immunological, substrate binding activities, etc. Non-limiting 
examples of DWF4 activities include: 

(a) activities displayed by other heme-thiolate enzymes; 

(b) characteristic Soret absorption peak at 40 nm when the substrate-bound 
5 reduced form is exposed to the hghts {see, e.g., Jefcoate et al, infra)\ 

(c) hydroxylation of various substrates via monooxygenase activity, which 
utiUzes molecular oxygen and reducing equivalents from NAD(P)H; 

(d) oxidation, dealkylation, deaminoation, dehalogenation, and sulfoxide 
formation that are involved in a variety of biological events in plants and animals 

10 (e.g., catabolism, anabolism, and xenobiotic activities); 

(e) recognition of at two substrates: campestanol (CN) and 6- 
deoxocastasterone (6-deoxoCS); 

(f) 22a-hydroxylase activity; 

(g) DWF4 phenotypic activities such as modulation of cell length, periods of 
15 flowering, branching, seed production, leaf size, and sterol composition in a plant; 

(h) regulation of gibberellic acid, cytokinins and/or auxin; 

(i) induce resistance to plant pathogens {see, e.g., U.S. Patent No. 5,952,545); 
0) accelerating growth at low temperatures; and 

(k) accelerating growth in dark conditions. 
20 A DWF4 analog, whether a derivative, fragment or fusion of native DWF4 

polypeptides, is capable of at least one DWF4 activity. Preferably, the analogs exhibit 
at least 60% of the activity of the native protein, more preferably at least 70% and 
even more preferably at least 80%o, 85%, 90%o or 95% of at least one activity of the 
native protein. 

25 Further, such analogs exhibit some sequence identity to the native DWF4 

polypeptide sequence. Preferably, the variants will exhibit at least 35%, more 
preferably at least 59%>, even more preferably 75%» or 80%* sequence identity, even 
more preferably 85% sequence identity, even more preferably, at least 90% sequence 
identity; more preferably at least 95%, 96%, 97%, 98% or 99% sequence identity. 

30 DWF4 analogs can include derivatives with increased or decreased activities 

as compared to the native DWF4 polypeptides. Such derivatives can include changes 
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within the domains, motifs and/or consensus regions of the native DWF4 polypeptide, 
which are described in detail in Example 3. 

Once class of analogs is those polypeptide sequences that differ from the 
native DWF4 polypeptide by changes, insertions, deletions, or substitution; at 
5 positions flanking the domain and/or conserved residues. For example, an analog can 
comprise (1) the domains of a DWF4 polypeptide and/or (2) residues conserved 
between the DWF4 polypeptide and other cytochrome P450 proteins, for example as 
shown in Figure 3 and described in Example 3. 

Another class of analogs includes those that comprise a DWF4 polypeptide 
1 0 sequence that differs from the native sequence in the domain of interest or conserved 
residues by a conservative substitution. For example, an analog that exhibits 
increased sterol binding can have optimized sterol binding domain sequences that 
differ from the native sequence. 

Yet another class of analogs includes those that lack one of the in vitro 
1 5 activities or structural features of the native DWF4 polypeptides, for example, 
dominant negative mutants or analogs that comprise a heme-binding domain but 
contain an inactivated steroid binding domain. 

DWF4 polypeptide fragments can comprise sequences from the native or 
analog sequences, for example fragments comprising one or more of the following 
20 P450 domains or regions: A, B, C, D, anchor binding, and proUne rich. Such domains 
and regions are shown in Figures 2B, 3 and described in Example 3. 

Fusion polypeptides comprising DWF4 polypeptides (e.g., native, analogs, or 
fragments thereof) can also be constructed. Non-Kmiting examples of other 
polypeptides that can be used in ftision proteins include chimeras of DWF4 
25 polypeptides and fragments thereof; and P450 polypeptides or fragments thereof, such 
as those shown in Figure 3. 

In addition, DWF4 polypeptides, derivatives (including fragments and 
chimeric proteins), mutants and analogues can be chemically synthesized. See, e.g., 
Clark-Lewis et al. (1991) Biochem. 30:3128-3135 and Merrifield (1963) J. Amer. 
30 Chem. Soc, 85:2149-2156. For example, DWF4, derivatives, mutants and analogues 
can be synthesized by solid phase techniques, cleaved from the resin, and purified by 
preparative high performance liquid chromatography (e.g., see Creighton, 1983, 
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Proteins, Structures and Molecular Principles, W. H. Freeman and Co., N.Y., pp. 
50-60). DWF4, derivatives and analogues that are proteins can also be synthesized by 
use of a peptide synthesizer. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
5 procedure; see Creighton, 1983, Proteins, Structures and Molecular Principles, W. H. 
Freeman and Co., N.Y., pp. 34-49). 

Further, the dwf4 polynucleotides and DWF4 polypeptides described herein 
can be used to generate antibodies that specifically recognize and bind to the protein 
products of the dwf4 polynucleotides, (See, Harlow and Lane, eds. (1988) 
10 "Antibodies: A Laboratory Manual"). The DWF4 polypeptides and antibodies thereto 
can also be used in standard diagnostic assays, for example, radioimmunoassays, 
ELISA (enzyme linked immunoradiometric assays), "sandwich" immunoassays, 
immunoradiometric assays, in situ immunoassay, western blot analysis, 
immunoprecipitationassays, immunofluorescent assays and PAGE-SDS. 

15 

Applications 

The present invention finds use in various applications, for example, including 
but not limited to those hsted above. 

The polynucleotide sequences may additionally be used to isolate mutant dwf4 
20 gene alleles. Such mutant alleles may be isolated firom plant species either known or 
proposed to have a genotype which contributes to altered plant morphology. 
Additionally, such plant dwf4 gene sequences can be used to detect plant dwf4 gene 
regulatory (e.g., promoter or promoter/enhancer) defects which can affect plant 
growth. 

25 The molecules of the present invention can be used to provide plants with 

increased seed and/fiiiit production, extended flowering periods and increased 
branching. The molecules described herein can be used to alter the sterol composition 
of a plant, thereby increasing or reducing cholesterol content in the plant. A still 
fiirther utility of the molecules of the present invention is to provide a tool for 

30 studying the biosynthesis of brassinosteriods, both in vitro and in vivo. 

The dwf4 gene of the invention also has utility as a transgene encoding a 
cytochrome P450 protein that mediates multiple 22a hydroxylation steps in 
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brassinosteriod biosynthesis which results in a transgenic plant to alter plant structure 
or morphology. The dwf4 gene also has utility for encoding the DWF4 protein in 
recombinant vectors which may be inserted into host cells to express the DWF4 
protein. Further, the dwf4 polynucleotides of the invention may be utihzed (1) as 
5 nucleic acid probes to screen nucleic acid libraries to identify other enzymatic genes 
or mutants; (2) as nucleic acid sequences to be mutated or modified to produce DWF4 
protein variants or derivatives; (3) as nucleic acids encoding 22a-hydroxylase in 
molecular biology techniques or industrial applications commonly known to those 
skilled in the art. 

1 0 The dwf4 nucleic acid molecules may be used to design plant dwf4 antisense 

molecules, useful, for example, in plant dwf4 gene regulation or as antisense primers 
in amplification reactions of plant dwf4 gene nucleic acid sequences. With respect to 
plant dwf4 gene regulation, such techniques can be used to regulate, for example, 
plant growth, development or gene expression. Further, such sequences may be used 

15 as part of ribozyme and/or triple helix sequences, also useful for dwf4 gene regulation. 
The dwf4 control element (e.g., promoter) of the present invention may be 
utilized as a plant promoter to express any protein, polypeptide or peptide of interest 
in a transgenic plant. In particular, the dwf4 promoter may be used to express a 
protein involved in brassinosteriod biosynthesis. 

20 The Arabidopsis DWF4 protein of the invention can be used in any 

biochemical applications (experimental or industrial) where 22a-hydroxylase activity 
is desired, for example, but not limited to, regulation of BL synthesis, regulation of 
other sterol synthesis, modification of elongating plant structures, and experimental or 
industrial biochemical applications known to those skilled in the art. 

25 

EXPERIMENTAL 
Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in any way. 
30 Efforts have been made to ensure accuracy with respect to numbers used {e.g. , 

amounts, temperatures, etc.), but some experimental error and deviation should, of 
course, be allowed for. 
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Example 1: Materials and Methods 

A. Plant Growth Conditions 

5 The conditions used for plant growth were essentially as described previously 

(Feldmann (1991) Plant 1 1 :71-82 ; Forsthoefel et al. (1992) Aust J. Plant Physiol 
19:353-366), except that agar-solidified medium contained 0.5% sucrose. Seedlings 
up to 2 weeks of age (6 weeks of age for dark-growth experiments) were grown on 
0.8% agar-soUdified medium containing 1 x Murashige and Skoog 1962 salts 

10 (Murashige, T., and Skoog, F. (1962) Physiol Plant. 15:473-497) and 0.5% sucrose 
(w/v) and cold treated (4°C) for 2 days in the dark before transfer to the light (24 hr 
light; 80 \\mo\ m"^ sec"^); older plants were grown in potting soil. The plates were 
sealed with Parafilm (American National Can Co., Chicago, IL) for the entire 
experiment. For nucleic acid extraction, genetic analysis, and other experiments in 

1 5 which mature plants were required, seeds were sown on Metromix 350 (Grace Sierra, 
Milpitas, CA) presoaked with distilled water. The pots were covered with plastic 
wrap and cold treated (4X) for two days before transfer to a growth chamber (16:8, 
light [240 jLimol m'^ sec"^]:dark; 22 and 21 X, respectively, and 75 to 90% humidity). 
The plastic wrap was removed 5 days after germination, and the pots were 

20 subirrigated in distilled water as required. Germination of seeds for dark growth 
experiments was induced by overnight exposure of the seeds to light immediately 
after removing the plates from incubation at 4°C. The dwf4-l and dwf4'2 mutations 
were in the Arabidopsis thaHana ecotype Wassilewskija (Ws-2) background; the 
dwf4-3 and dwf4-4 mutations were in the Enkheim (En-2) background. 

25 

B. Analytical Methods 

Protoplasts were obtained by overnight incubation of shced leaves in 0.1% 
cellulysin, 0.1% driselase, 0.1% macerase (Calbiochem, San Diego, CA) in 125 mM 
Mes, pH 5.8, 0.5 M mannitol, and 7 mM CaC12 (Galbraith et al. (1992) Planta 
30 1 86:324-326 . Immediately before observation, chloroplasts were stained with a 

solution of 1.5% KI and 1%) Measurements were performed as described for tissue 
sections, and plane areas were calculated according to the formula A = Tcr^. 
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Chlorophyll determinations were performed from 2-week-old soil-grown 
plants. Green tissue was weighed, frozen in liquid nitrogen, and extracted in dim light 
with 80% acetone in the presence of a mixture of equal parts sand, NaHC03, and 
Na2S04. After brief centrifugation, the supernatant was collected and the extraction 
5 was repeated twice, pooling the supematants from each sample. Chlorophylls a and b 
were measured spectrophotometrically, as described in Chory et al. (1991), supra. 

C. Growth Signal Response Measurements 

Gibberellic acid (OA) response was assayed on plants grown individually in 
10 5.7-cm pots. Once inflorescences reached 1 to 2 mm in height, they were sprayed 
weekly with 1 mM GAS (Sigma). Control plants were sprayed with water. One week 
after the third spraying, plants were collected, and the length of the main stem was 
measured between the top of the rosette and the base of the most distal pedicel; 13 to 
18 plants of each line were measured per treatment. Auxin response was tested by 
15 growing seedlings for 10 days under 16 hr of light on vertically oriented agar plates 
containing various concentrations of 2,4-D (Gibco, Grand Island, NY). Genetic 
interaction with the hy2 mutation was tested by growing seedlings under continuous 
light for 7 days. Brassinolide (BL) response was determined in hquid culture, as 
described by Clouse et al. (1993), supra, except that three or four seedlings were 
20 grown in each well of a 24-well culture plate for 7 days. Measurements were taken for 
10 to 20 seedlings for each genotype and condition, under a dissection microscope 
fitted with an ocular micrometer. 

D, Microscopy 

25 Tissues were fixed in 2% glutaraldehyde and 0.05 M sodium cacodylate, pH 

6.9, for 2 hr at room temperature or ovemight at 4''C, followed by three washes in 
buffer. For hght microscopy, 1% safranin was included in the first wash, and 
embedding was performed in Paraplast Plus (Oxford Labware, St. Louis, MO). 
Ten-millimeter sections from five individual plants per line were analyzed and 

30 photographed, and cell measurements were taken using a ruler on 5 x 7 inch prints. A 
print of a hemocytometer grid at the same final magnification was used for 
calibration. At least 25 cells were measured per sample, with a minimum of 150 cells 
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per line. For electron microscopy, the tissues were treated after fixation with 1% 
tannic acid in buffer for 30 min, washed three times, and postfixed in 1% OSO4 in 
buffer for 2 hr, followed by five washes and dehydration through an ethanol series. 
Samples for transmission electron microscopy were embedded in Spurr's resin. 
5 Sections (90 nm) were stained with saturated uranyl acetate followed by Reynolds's 
lead citrate (Reynolds (1963) J. Cell Biol 17:208-212) and examined in a JEOL 
(Tokyo, Japan) 100-CX instrument. For scanning electron microscopy, samples were 
transferred to freon 113, critical point dried, and sputter-coated with 30 to 50 nm of 
gold. Analysis was performed in a microscope (ISI model DS130; Topcon, Inc., 
1 0 Paramus, NJ) with an accelerating voltage of 15 kV. Electron microscopy was 

performed at the Electron Microscope FaciUty, Division of Biotechnology, Arizona 
Research Laboratories, University of Arizona. 

Example 2: Isolation of dwf4 gene 

15 

A. Isolation of the DWF4 Gene 

The dwf4-l mutation was identified in a screen of 14,000 transformants of 
Arabidopsis, resulting in a dwarfed phenotype similar to dwfl (Feldmann and Marks 
(1987) Ma/. Gen. Genet 208:1-9; Feldmann et al. (1989) Science 243:1351-1354; 

20 referred to as diminuto in Takahashi et al. (1995) Genes Dev. 9:97-107 and Szekeres 
et al., supra) and det2 (Azpiroz et al. (1998), supra). Two independent lines were 
found that segregated for a similar phenotype: both were shorter than dwfl, but their 
rosette diameter was comparable to that mutant. These dwarfs were also essentially 
infertile. The most striking aspect of the morphology of these mutants is their 

25 similarity to det2 (Chory et al. (1991) Plant Cell 3 :445-459). For this reason, further 
analysis was conducted with these Unes. After being found to be allelic to each other, 
both were designated as dwf4. 

dwf4-l segregated for a single kanamycin resistance marker, and gel blot 
analysis with DNA from single plants of this family confirmed that the pattern is 

30 consistent with a single insert. The dwf4 mutation was subsequently shown to be 
inherited as a monogenic, recessive Mendehan trait that, in dwf4-l, cosegregates with 
the dominant kanamycin resistance marker contained in the T-DNA, suggesting that 
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the mutation in this Une may be a disrupted, tagged allele. dwf4-2 also contains a 
single kanamycin resistance marker, but it failed to cosegregate with the dwarf 
phenotype. Two additional alleles {dwf4-3 and dwf4-4) were identified among dwarf 
mutants obtained from the Nottingham Arabidopsis Resource Centre (Nottingham, 
5 UK; N365 and N374). Unless otherwise indicated, all experiments presented below 
were performed with dwf4-L 

Standard molecular techniques were performed as described previously 
(Sambrook et al. 1989). The plant DNA flanking the T-DNA was cloned using the 
plasmid rescue technique as described by Dilkes and Feldmann (1998) "Cloning 

10 genes from T-DNA tagged mutants" in Methods in Molecular Biology: Arabidopsis 
Protocol, J. Martinez-Zapater and J. Salinas, eds (Totowa, NJ: Hxxmana Press), pp. 
339-351. Briefly, dwf4-l genomic DNA was digested with EcoRI (for the right 
border) or Sail (for the left border), ligated under conditions to maximize 
intramolecular events, and introduced into competent Escherichia coli cells. The 

15 resulting colonies were screened on ampicillin. Five colonies from the left border 
transformation contained plant DNA flanking the insertion site. The restriction 
pattern displayed two different types of plant DNA. Three contained a 5.6-kb insert, 
whereas the other two contained a 1 . 1 -kb insert. This result suggested that the 
T-DNA insert in dwf4-l was flanked by two left border sequences. The existence of 

20 two left border sequences was confirmed by gel blot analysis with genomic DNA, 

using the putative plant flanking DNAs as probes. A single wild-type EcoRI fragment 
was split into two fragments in dwf4-L 

Wild-type genomic clones were isolated from a library made from Ws-2 
DNA by using the 5.6-kb fragment as a probe. The library was constructed using X 

25 DASH-II arms (Stratagene, La JoUa, CA). Approximately 10,000 primary plaques 
were screened. Duplicate-filter screening resulted in 12 positives. Restriction 
mapping of the secondary clones revealed that some contained part of the DWF4 
locus. In fact, one of the clones, D4G12-1, contained an intact 13-kb DNA spanning 
the T-DNA insertion site. The 13-kb insert in D4G12-1 was subcloned into 

30 pBluescript SK- (Stratagene). Subclones were sequenced from each end of the insert 
by using the universal primers in the plasmid. DNA sequencing was performed using 
an ABI 377 (Perkin-Ehner, Norwalk, CT ) automated sequencer at the Arizona 
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Research Laboratories (Tucson, AZ). 

Reverse transcriptase-polymerase chain reaction (RT-PCR) was used to isolate 
a cDNA clone. RNA was isolated from 5 -day-old dark- and light-grown seedlings. 
Superscript II reverse transcriptase (BRL, Gaithersburg, MD) was used for the cDNA 
5 synthesis, according to the manufacturer's protocol. Briefly, 7 \ig of total RNA was 
mixed with the reverse primer, D4R3. To the heat-denatured RNA-primer mix, the RT 
mixture was added and incubated for 1 hr at 43 °C. Two microliters of RT product 
was used for PCR ampHfication by using different primers sets intended to cover all 
of the putative coding region. RT-PCR products were fractionated on an 0.8% 
10 agarose gel (Sambrook et al. 1989); the expected bands were purified using a 
Geneclean kit (BIO 101, Inc., Vista, CA), fiirther amplified, and sequenced to 
determine the coding region. 

B. Sequencing 

1 5 dwf4-2 was isolated from a T-DNA mutant population as an untagged allele, 

whereas dwf4-3 and dwf4-4 were obtained from plants obtained from the Nottingham 
Arabidopsis Stock Centre (University of Nottingham, UK; stock nos. N365 and 
N374); the mutagenesis method for these two lines is not known. Based on the DNA 
sequence of wild-type genomic DNA, pairs of primers were designed to ampUfy 

20 1-kb stretches of genomic DNA. Oligonucleotide sequences are shown 5' to 3\ The 
numbers shown correspond to positions in the genomic sequence, with the adenine 
base in the translation initiation codon set as position 1. D40VERF, 
l-ATGTTCGAAACAGAGCATCATACT-24 (SEQ ID N0:3); D4PRM, 
(-l)-CCTCGATCAAAGAGAGAGAGA-(-21) (SEQ ID N0:4); D4RTF, 

25 143-TTCTTGGTGAAACCATCGGTTATCTTAAA-171 (SEQ ID N0:5); D4RTR, 
853-TATGATAAGCAGTTCCTGGTAGATTT-828 (SEQ ID N0:6); D4F1, 
(-242)-CGAGGCAAC-AAAAGTAATGAA-(-222) (SEQ ID N0:7); D4R1, 
689-GTTAGAAACTCTAAAGATTCA-669 (SEQ ID N0:8); D4F2, 
576-GATTCTTGGCAACAAAACTCTAT-598 (SEQ ID N0:9); D4R2, 

30 1685-CCGAACATCTTTGAGTGCTT-1666 (SEQ ID NO:10); D4F3, 

1606-GTGTGAAGGTTATAAATGAAACTCTT-1631 (SEQ ID N0:1 1); D4R3, 
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3156-GGTTTAATAGTGTCGACACTAATA-3132 (SEQ ID N0:12); D4F4, 
2316-CCGATGACTTGTACGTGCGTTA-2337 (SEQ ID N0:13); D4F5, 
730-GCGAAGCATATAATGAGTATGGAT-753 (SEQ ID NO: 14); and D4R5, 
1876-GTTGGTCATAACGAGAATTATCCAAA-1851 (SEQ ID NO: 15). Because 
5 the two stock center lines were in a different genetic background than the wild-type 
gene that we had sequenced (WS), primers were based primarily on the exon sequence 
to avoid sequence variation between introns. Genomic DNA isolated from the 
mutants was subjected to PGR, using these primer sets. The amplified DNA 
fragments were fractionated on 0.8% TAE agarose gel (Sambrook et al. 1989), 
10 purified using Geneclean (BIO 101, Inc.) or Qiaquick™ columns (Qiagen Inc., 

Chatsworth, CA), and sequenced. Putative mutations were identified by comparing 
the mutant DNA sequence with the wild-type sequence. The sequence was confirmed 
by sequencing independently ampUfied fragments at least three times for each 
mutation to eliminate PGR misincorporation. 

15 

C. Sequence Analysis 

Annotations in multiple sequence ahgnment were performed using the 
ALSCRIPT package provided by Barton, G.J. (1993) Protein Eng. 6:37-40. Searches 
for similar protein sequence were performed with the BLAST program (Altschul et al. 

20 (1990), supra). In addition, usefiil packages, available on the internet, such as 

promoter, protein targeting, polyadenylation site, and splice site, have been employed 
to characterize the DNA and protein sequence (consolidated in the search launcher, 
Baylor College of Medicine, Baylor, TX). All other sequence analysis was performed 
using the Genetics Computer Group (Madison, WI) software package. 

25 Analysis of the complete genomic sequence, starting at the EcoRI site, with 

the promoter prediction by neural network (NNPP) package 
(http://www-hgc.lbLgov./projects/promoter.html), indicated that the gene included a 
putative promoter (TAT AT is found in the putative promoter region between 
nucleotides -143 to -78) and polyadenylation signal sequences (AATAA near a 

30 position at 3238 bp and a putative GU-rich signature from 3283 to 3290 bp). 

Unsuccessfiil attempts to detect mRNA by tissue-specific RNA gel blot 
analysis, using the 4.8-kb fragment as a probe, suggested that DWF4 encoded a rare 
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message. In addition, there were no matching expressed sequence tags in the 
Arabidopsis database. Therefore, we screened two different cDNA libraries made 
with either normahzed mRNA from different tissues or RNA from floral tissues, 
using the 4.8-kb fragment as a probe (ABRC stock numbers CD4-7 and CD4-6, 
5 respectively). After finding no positives in 109 clones screened, we chose to directly 
amplify DWF4 cDNA from total RNA made from 5-day-old seedlings, using reverse 
transcriptase-polymerase chain reaction (RT-PCR). Whereas RNA from both 
Ught-grown and dark-grown seedlings yielded the expected RT-PCR products, RNA 
from dark-grown seedlings generated significantly more. The bands were gel purified 

10 and sequenced. Ahgnment of the genomic and cDNA sequences indicated that the 
DWF4 gene was composed of eight exons and seven introns (Figure 2 A; Figure 10). 

Sequence analysis of the dwf4-l allele revealed that the T-DNA was inserted 
in the 5' end of intron 7 (Figure 2A). In addition, sequence analysis of the left border 
plant jxmctions indicated that at one junction (5'), 75 bp of unknown DNA was 

15 inserted, whereas at the other junction (3'), 24 bp of left border and 19 bp of plant 
DNA were deleted. To prove that DWF4 had been cloned, two other dwf4 alleles 
{dwf4~2 and dwf4-3) were sequenced to identify possible lesions. As shown in Figure 
2B, dwf4-2 contained a deletion of three conserved amino acids (324 to 326) caused 
by a 9-bp deletion, and dwf4S contained a premature stop codon (289) caused by 

20 changing a tryptophan codon (UGG) to a nonsense codon (UGA). Due to a premature 
stop codon, translation is predicted to be terminated before the heme binding domain, 
which is essential for cytochrome P450 function (Poulos et al. (1985) J. Biol. Chem. 
260:16122-16130). Because T-DNA-generated alleles dwf4-l and dwf4'-2 and an 
additional mutant allele all possess loss-of-function mutations affecting the same 

25 protein, we conclude that we have cloned the DWF4 gene. 

Example 3: The DWF4 Gene Encodes a Cytochrome P450 

The open reading frame of DWF4 encodes a protein composed of 513 amino 
acids. BLAST database searches (Altschul et al. (1990) MoL Biol 215:403-410) 
30 for similar sequences yielded a superfamily of cytochrome P450 proteins as 
significant high-scoring segment pairs. Cytochrome P450s are heme-thiolate 
enzymes. They display a characteristic Soret absorption peak at 450 nm when the 
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substrate-bound, reduced form is exposed to the light (Jefcoate (1978) "Measurement 
of substrate and inhibitor binding to microsomal cytochrome P-450 by 
optical-difference spectroscopy" in Methods in Enzymology, Vol. 52, S. Fleischer and 
L. Packer, eds (London: Academic Press), pp, 258-279), Typical microsomal 
5 cytochrome P450s hydroxylate various substrates via their monooxygenase activity, 
which utilizes molecular oxygen and reducing equivalents from NAD(P)H. In 
addition to the hydroxylation, other activities of cytochrome P450 enzymes, such as 
oxidation, dealkylation, deamination, dehalogenation, and sulfoxide formation, are 
involved in a variety of biological events in catabolism, anabolism, and xenobiotic 

10 metabolism in plants as well as animals (reviewed in West (1980) "Hydroxylases, 
monooxygenases, and cytochrome P-450" in The Biochemistry of Plants: A 
Comprehensive Treatise, Vol. 2, Metabolism and Respiration, D.D. Davies, ed (New 
York: Academic Press), pp. 317-365; Nebert and Gonzalez (1987), supra; Guengerich 
(1990) Crit Rev. Biochem, Mol BioL 25:97-152, Guengerich (1993) Am. ScL 

15 81:440-447; Durst (1991) "Biochemistry and physiology of plant cytochrome P-450" 
in Microbial and Plant Cytochromes P-450: Biochemical Characteristics, Genetic 
Engineering and Practical Implications, K. Ruckpaul and H. Rein, eds (London: 
Taylor and Francis), pp. 191-232; Bolwell et al. (1994) Phytochemistry 
37:1491-1506; Durst and Nelson (1995), supra\ Schuler (1996) CRC Crit, Rev, Plant 

20 Sci, 15:235-284). Evolutionarily, cytochrome P450s have been found in a broad 
spectrum of living organisms, and they share significant homology at the amino acid 
sequence level. Thus, it has been proposed that all known cytochrome P450s were 
derived from a common ancestor (Nelson and Strobel (1987) Mol Biol EvoL 
4:572-593). 

25 Typical cytochrome P450s contain four characteristic domains as defined by 

Kalb and Loper 1988. Of the four domains, A, B, C, and D, at least two of them have 
been assigned specific functions. Domain A binds a substrate and molecular oxygen, 
and domain D has been shown to bind heme-prosthetic groups via a thiolate bond 
(Poulos et al. 1985). Thus, typically, microsomal cytochrome P450 enzymes can be 

30 identified by their characteristic signature sequences, including the heme binding 

domain, domain A (also referred to as dioxygen binding), domain B (steroid binding), 
and domain C (Nebert and Gonzalez (1987) Annu. Rev. Biochem. 56:945-993; Kalb 
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and Loper (1988) Proc. Natl Acad. Sci USA 85:7221-7225). All of these signature 
sequences were found in DWF4; the relative positions of the domains are indicated in 
Figure 2B. 

Durst and Nelson (1995) DrugMetab. Drug Interact. 12:189-206 classified 
5 plant cytochrome P450s into two distinct groups based on their clustering nature in a 
phylogenetic tree. All of the group A families cluster and are assumed to originate 
from a common plant P450 ancestor. The group A cytochrome P450s conform to the 
characteristic consensus sequences (A/G)GX(D/E)T(T/S) in domain A (also called 
helix I) and PFG(A/SA^)GRRXC(P/AA^)G of the heme binding domain (D) with only 

10 a few exceptions. Group A cytochrome P450s appear to catalyze plant-specific 

reactions such as lignin biosynthesis (Figure 6; GenBaiik accession number P48421). 
By contrast, P450s that do not belong to group A (non-A P450s) are scattered in the 
phylogenetic tree. They share more amino acid identity/similarity with P450s found 
in animals, microbes, and fungi than with those found in plants. The non-A P450s 

15 possess functions, such as steroid metabolism, that are not limited to plants. 

Generally, non-A P450s have hmited homology with known domains described for 
group A. 

The most similar protein to DWF4 is the Arabidopsis CPD protein, a non-A 
P450. A mutation in CPD also caused dwarfism (Szekeres et al. 1996; CYP90A1, 

20 GenBank accession number X87368). DWF4 and CPD share 43% identity and 66% 
similarity. Conforming to the recommended nomenclature for cytochrome P450 
enzymes, DWF4 and CPD (CYP90A1) are grouped into the same family within 
different subgroups (Durst and Nelson (1995) Drug Metab. Drug Interact, 
12:189-206). As such, DWF4 represents a second member of the CYP90 family and 

25 is designated CYP90B 1 . Sequence similarity between the two proteins occurs 
throughout their length, with the greatest similarity in the classically conserved 
domains. Residues conserved between DWF4 and CYP90A are boxed and italicized 
in Figure 3, The second most similar protein is the tomato CYP85 (Bishop et al. 
(1996), supra; GenBank accession number U54770). A mutation in this gene also 

30 results in dwarfism. DWF4 and CYP85 share 35% identity and 59% similarity in their 
overall protein sequences. 
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Six cytochrome P450 sequences with the greatest homology to DWF4, 
CYP90A1, CYP85, CYP88 (Winkler and Helentjaris (1995) Plant Cell 7:1307-1317; 
GenBank accession number U325 79), cyanobacteria CYP120 (Kaneko et al. (1996) 
DNA Res. 3:109-136; GenBank accession number D64003), human CYP3A3X 
(Molowa et al. (1986) Proc. Natl. Acad. Sci. USA 83:531 1-5315; GenBank accession 
number Ml 3785), and zebrafish CYP26 (White et al. White (1996) J. Biol. Chem. 
271:29922-29927; GenBank accession number U68234), were chosen for multiple 
sequence alignment. Putative domains defined by Kalb and Loper (1988), supra are 
boxed and labeled in Figure 3. First, the heme binding domain pFGgFpRlCpGkel 
matches completely the sequence defined previously. Uppercase letters in the domain 
indicate amino acids conserved at all seven sequences in the aUgnment, and 
lower-case letters represent residues conserved in at least half of the proteins. Of the 
amino acids conserved in the heme binding domain, the function of the cysteinyl is 
established as a thiolate ligand to the heme (Poulos et al. (1985), supra). 

Domain A is defined by xllfaGhEttssxIxxa. Lowercase x's indicate variable 
amino acids. An invariant glutamate (E) preceded threonine (T) at position 3 14, 
T314, which is believed to bind dioxygen, was conserved in all proteins compared 
except CYP88 of maize. The second signature sequence, domain B, is also conserved 
in DWF4 with significant similarity. A valine at position 370 is conserved in all of 
the proteins, but it does not appear in Kalb and Loper's classic report (1988) on 
conserved domains. Again, DWF4 matches the domain C consensus sequence. 
Finally, the anchoring domain in the N-terminal end was distinguished by a repeat of 
the hydrophobic residue leucine. In addition, in DWF4, two acidic (glutamate) and 
two basic (histidine) residues precede the repeated leucine in the N-terminal leader 
sequence. These charged residues may add more stability to the membrane topology 
of the protein as a strong start-stop transfer peptide (von Heijne (1988) Biochim. 
Biophys. Acta 947:307-333). 

Thus, phylogenetic analyses of these seven proteins with cytochrome P450s 
unique to plants (group A; Durst and Nelson (1995), supra) indicate that DWF4 does 
not cluster with these cytochrome P450s (Figure 6). Rather, DWF4 clustered with 
cytochrome P450s firom other organisms: cyanobacteria (CYP120), rat (CYP3A2), 
human (CYP3A3X), and plants (CYP90, CYP85, and CYP88). DWF4 also deviates 
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from the consensus sequence in the group A heme binding domain in that it possesses 
a PFGGGPRLCAG sequence in which arginine (R) is substituted for proUne (P). 
However, domain A of DWF4, AGHETS, fits the consensus of domain A of group A. 
These characteristics suggest that DWF4 is a monooxygenase^ similar to P450s of 
5 group A, that utilizes molecular oxygen as a source of the hydroxyl group, but it 
mediates some reaction(s) that are not necessarily specific for plants, for instance, 
steroid hormone biosynthesis, which is a critical event for animals. In fact, the 
similarity of DWF4 to the rat testosterone 6P-hydroxylase (34%; GenBank accession 
number 631895) or glucocorticoid-inducible hydroxylase (31%; Molowa et al. 1986; 
10 GenBank accession number M13785) supports this idea. Further, the similarity that 
DWF4 shares with CYP90A and CYP85, 66 and 59%, respectively, is additional 
proof that it is involved in plant steroid biosynthesis (Bishop et al. 1996 ; Szekeres et 
al. 1996). 

1 5 Example 4: The dwf4 Phenotype 

As formally defined, a plant with a dwarf phenotype is one that has a short, 
robust stem and short, dark green leaves, dwf4 mutants are significantly smaller than 
the wild type and are dark green in color. They have short, rounded leaves. Again, the 
dwf4 phenotype is reminiscent of the light-regulatory mutant det2 (Chory et al., 

20 supra); however, complementation analysis has shown that the two mutations are not 
allelic, with the dwf4 mutation mapping to the lower arm of chromosome 3 and det2 
mapping to chromosome 2 (Chory et al., supra). The results presented in Table 1 
show that soil-grown d\yf4 plants attained a height of <3 cm at 5 weeks, whereas 
wild-type plants grew to >25 cm. Moreover, individual organs, such as leaves, were 

25 invariably shorter in dwarf plants. dwf4 siliques were also markedly shorter than those 
of the wild type and were infertile. The loss of fertility of dwf4 was due to the reduced 
length of the stamen filaments relative to the gynoecium, which resulted in mature 
pollen deposition on the ovary wall rather than on the stigmatic surface. Hand 
pollination of dwf4 flowers with either mutant or wild-type pollen resulted in good 

30 seed set without significantly changing the size of the siliques. 
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Table 1: The Development of Wild-Type and dwf4-l plants 


Measurement 


Wild-Type' 


dwf4-r 


Five Weeks 






Height 


25.8 ± 2.6 cm 


2.8 ± 0.3 cm 


Leaf blade length^ 


1.72 ±0.36 cm 


0.96 ±0.15 cm 


Leaf blade widttf 


0.77 ±0.10 cm 


0.99 ±0.18 


No. inflorescences 


3.6 ±0.5 


10.5 ±1.4 


No. rosettes 


7.1 ±0.9 


13.5 ± 1.3 


Other 






oidii yji. nvjwt'iiiig 


?1 5 davs 


25 9 dav<? 


length 


1 1 r=i ± 0 07 rm 




No. seeds per 
silique 


37.7 ±3.3 


0.0 


Final no. of siliques 


336.5 ± 90.6 


988.4 ±214.2 


Height at maturity 


27.0 ± 2.7 


1 1.6 ± 1.0 cm 



^ results shown are the average ±SD of measurements taken from 10 plants 
^ measurements taken from the second pair of leaves 



20 

Another feature of dwf 4 plants is a reduction in apical dominance, as was 
evident by the threefold increase in the number of inflorescences at 5 weeks of age 
(Table 1). Mutants also had twice the number of rosette leaves, which may be 
explained by a prolonged vegetative phase in the dwf4 plants. Development of flowers 

25 on the primary inflorescence was delayed by -^4 days in dwf4, but the flowering phase 
was significantly longer in the mutant, with senescence of the last flower occurring at 
-^98 days compared with -57 days for the wild type. One result of this delay in 
senescence was that dwf4 plants contained almost three times the number of siliques 
as did the wild type (Table 1). 

30 The reduced stature observed in soil-grown dwf4 was also observed in 

hypocotyls of agar-grown seedlings. Measurements of hypocotyl length over time 
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indicated that not only were dwf4 seedlings shorter than wild-type seedlings 
immediately after germination but also that the rate of growth was retarded in the 
mutants (Figure 5). In addition, dwf4 hypocotyls reached their terminal length in <5 
days, whereas wild-type seedlings continued to grow. 

5 In sum, the dwf4 phenotype can be described as being due to both primary and 

secondary effects of reduced cell elongation. The primary effect is simply a reduction 
in the length of individual organs exclusively along theh normal growth axis; that is, 
organ width is not reduced (Table 1). The secondary effects of reduced cell elongation 
are themselves due to the reduction in organ length. The dark green color of the 

1 0 leaves, for example, may be due exclusively to the existence of a wild-type number of 
chloroplasts in a significantly smaller cell. Similarly, the sterility of mutants is a 
consequence of the shortness of the stamens, which fail to deposit their pollen on the 
stigmatic surface. In addition to the morphological alterations ofdwf4, mutants 
display delayed development, the first sign of which occurs at flowering (Table 1). 

1 5 Because rosette leaves are produced continuously during vegetative development, 
delayed flowering results in dwf4 rosettes having ahnost twice the number of leaves 
observed in the wild type. 

Example 5: The Growth Defect of dwf4 Is Due to a Reduction in Cell Length 

20 Both the short stature and the reduced growth rate of dwf4 could be due to a 

defect in cell division or cell elongation or both. To distmguish between these 
possibilities, we analyzed sections from 7-day-old hypocotyls and 5-week-old 
inflorescence stems, by light microscopy, as described in Example 1. To minimize 
variations due to the developmental stage of the sample, we always took the stem 

25 sections from the fourth intemode. As shown in Table 2, the average cell size in dwf4 
is significantly smaller than in wild-type plants, whereas no differences were detected 
in the number of cells along the length of either organ between the wild type and 
dwf4. Therefore, the short stature and reduced organ length of dwf4 are largely or 
exclusively due to a failure of individual cells to elongate. No differences were 

30 observed in the number of cell layers contained in the wild type and dwf4. 
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Table 2: Cell Length in WUd-Type and dwf4 plants 


Measurement 


Wild-type 


dwf4 


average cell length in hypocotyl: 7 day old 
plant 


92.7 ^m 


32.2 ^im 


average cell length in stem: 5-week old plant 


79.2 ^m 


15.0 ^im 



The small size oidwf4 cells offers a possible explanation for the dark green 
color of the mutant plants. Chlorophyll measurements were taken, and leaf mesophyll 
protoplasts were prepared, stained, and measured to visuaUze and count chloroplasts, 
as described in Methods. Although there were no significant differences in total 
chlorophyll content, the chlorophyll a/b ratio, or the absorption spectra between 
wild-type plants and mutants, the mean plane area (the apparent two-dimensional 
surface area of mounted cells) oidwf4 leaf mesophyll protoplasts was 376 mm^ 
whereas that of wild-type protoplasts was 599 mml The two-dimensional comparison 
of plane area represents a dramatic reduction in volume for dwf4 cells. However, the 
number of chloroplasts per cell was only sHghtly lower: the mean number of 
chloroplasts per cell was 40 for dwf4 and 44 for the wild type. Therefore, dwf4 cells 
contain a greatly increased number of chloroplasts per unit cell volume. As a 
consequence, the chloroplasts are brought closer to each other, making the color of the 
leaves appear darker. Chloroplast size was the same in both Unes. 

Thus, the rate of growth was significantly reduced in agar-grown dwf4 
seedlings, which ceased to grow when their hypocotyl length was <20% of the final 
wild-type length. Because all of the cells in a hypocotyl before the initiation of leaf 
development are present in the embryo, the initial growth of seedlings is due 
exclusively to cell expansion, which therefore must be reduced in dwf4. A similar 
situation applies to soil-grown plants. Five weeks after germination, well after plants 
had bolted, dwf4 plants were shorter than wild-type plants (Table 1). Although the 
mutants continued growing for several weeks more than did the wild type, they 
remained shorter through senescence. That cell elongation is the direct cause of this 
decreased growth is shown by measurements of cell length both in 7-day-old 
hypocotyls (Table 2) and in 5-week-old stems (Table 2). Not only is the reduction in 
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cell length in good agreement with the reduction in organ length, but insofar as could 
be determined, there is no difference in the number of cells between dwf4 and 
wild-type plants. 

Organ growth by cell elongation in plants occurs as part of normal 
development in response to a variety of input signals. Mutants that are defective in 
these signaling pathways invariably fail to elongate normally in response to the 
appropriate stimuli. A mutant with a block at a step that is common to several 
individual pathways would therefore be expected to have defective responses to all of 
the corresponding signals. dwf4 appears to be such a mutant. Figure 6 shows that 
elongation induced by the hy2 mutation is blocked in a dwf4 hy2 double mutant. Not 
surprisingly, in view of this result, dwf4 also failed to display hypocotyl elongation as 
a response to growth in complete darkness. In addition, dwf4 was capable of 
perceiving GA, but its response was severely compromised. This mutant could also 
respond to the inhibitory effects of auxin but was incapable of auxin-stimulated 
elongation. It was only exogenous BL that fully restored wild-type length to dwf4 
hypocotyls (Choe et al. (1998), supra). 

Because dwf4 failed to respond to at least three independent signaling 
pathways but responded fully to only one, the most likely explanation for the dwarf 
phenotype is therefore that a fully functional BR system is required for a full response 
to GA, auxin, and deetiolation. From the perspective of cellular economy, it may be 
advantageous that the downstream elements involved in cell elongation are shared 
among at least some of the signaling pathways that evoke this response. The 
interaction of various pathways at a common step provides the plant with a potential 
point for the integration of signals produced by diverse independent stimuli. Our 
results indicate that BRs act at this downstream step. 

Example 6: dwf4 Is Specifically Rescued by Brs 

The reduced length of cells in dwf4 hypocotyls and inflorescence stems is 
indicative of a failure of these cells to elongate during development. A variety of 
endogenous and environmental signals is responsible for stimulating elongation in 
plants; therefore, a series of experiments was performed to determine whether dwf4 is 
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affected in a specific signaling pathway or is blocked in elongation as a response to 
various signals. 

Of the endogenous (hormonal) signals that might be deficient in dwarf plants, 
an obvious candidate is GA, because gibberellin-deficient mutants are shorter in 
stature than are the wild-type plants (Koomneef and Van der Veen, supra). Our 
results, however, indicate that dwf4 is not defective in the synthesis of gibbereUins. 
When germinated on 10"^ M GA, wild-type seedlings demonstrated an elongation 
response (Figure 6), whereas dwf4 seedhngs responded minimally, if at all. At 10""^ M 
GA, wild-type seedlings elongated shghtly more than at 10"^ M, but the dwf4 
seedlings were essentially saturated for elongation at 10"^ M GA. Similar results were 
obtained when soil-grown plants were sprayed with 1 mM GA once inflorescences 
first became visible: dwf4 inflorescence stems elongated by only 28% above the 
untreated controls, whereas those of the wild type elongated by 45% above the 
untreated controls. Mutants that owe their reduced stature to decreased levels of 
endogenous gibberellins can be fiilly rescued by added hormone (Koomneef and Van 
der Veen, supra\ Talon et al., supra). In addition, dwf4 seeds germinate in the absence 
of exogenously supplied GA. Our results therefore suggest that dwf4 is not deficient 
in endogenous GA. A corollary conclusion from this experiment is the demonstration 
that dwf4 is capable of detecting GA; that is, it is not likely to be affected in signal 
perception but rather is defective in the extent to which it can respond to this signal. 

Auxin can also stimulate cell elongation. This effect is especially visible in 
young seedlings (Klee and Estelle, supra). The response of wild-type and dwf4 plants 
to auxin was tested by growing seedlings for 10 days on vertically oriented plates 
containing various concentrations of the synthetic auxin 2,4-D. At all concentrations 
assayed, inhibition of root growth was evident. Figure 6 shows that at 10"^ M 2,4-D, 
hypocotyl elongation in wild-type and dwf4 seedlings was similar to that of the 
controls. Higher concentrations of auxin were inhibitory for both wild-type and dwf4 
seedlings, and lower concentrations had no effect. In view of the inhibition of root 
growth, it is clear that dwf4 is not auxin resistant; rather, its elongation response is 
compromised. 

As mentioned above, the most obvious exogenous signal for plants is light. 
Therefore, to investigate whether Ught-regulated cell elongation is altered in dwf4. 
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wild-type and dwf4 seedlings were grown in the dark, as described in Example 1. 
Figure 6 shows that as expected, wild-type seedlings displayed hypocotyl elongation 
typical of etiolated growth. By contrast, dark-grown dwf4 seedlings were only 
shghtly longer than those grown in the hght. To assess the relationship between the 
5 dwf4 phenotype and hght sensing by dwf4, the mutation was crossed into a mutant 
defective in the HY2 gene. All hy mutants share the common phenotype of an 
elongated hypocotyl that mimics part of the etiolation response in the hght. 
Specifically, hy2 is deficient in active phytochrome because chromophore 
biosynthesis does not take place (Chory et al (1989a) Plant Cell 1:867-880). Figure 6 
10 shows that dwf4 hy2 double mutants displayed a dwarfed phenotype indistinguishable 
fi-om that ofdwf4 HY2 (light-grown control); therefore, the elongation block due to 
the dwf4 mutation is epistatic to a defect in phytochrome activity. 

In the course of our studies, we prepared a genomic library from dwf4-l, from 
which we isolated a clone in which a fragment of T-DNA interrupts a gene encoding a 
1 5 putative cytochrome P450 steroid hydroxylase. Because BRs have been shown to 
elicit elongation in Arabidopsis (Clouse et al. (1993) /. Plant Growth ReguL 
12:61-66) and because BR-deficient mutants have been recently described 
(Kauschmann et al. (1996), supra; Li et al. (1996), supra; Szekeres et al., supra ), we 
tested the effect of BL on Arabidopsis seedlings by germinating seeds in liquid 
20 medium containing different amounts of BL. As shown in Figure 6, the dwf4 

hypocotyls were restored to wild-type height by 10"^ M BL. This, together with our 
identification of a disrupted gene encoding a putative BR biosynthetic enzyme, 
strongly suggests that the phenotype of dwf4 is specifically due to a defect in BR 
biosynthesis (see Choe et al. (1998) Plant Cell 10:231-243). 
25 Thus, the results indicate that BL is involved at or near a downstream control 

point where multiple signaling pathways interact. First, as shown in Figure 6, BL is 
required for cell elongation as a response to darkness as well as GA and auxin. In 
addition, previous studies (Kauschmann et al. (1996), supra; Li et al. (1996), supra; 
Szekeres et al. (1996), supra) and the work described herein show that BR can 
30 compensate for the cell elongation defect of mutants as diverse as det2, cpd, dwf4, 
detl, cop I, and dwfL This places BRs downstream of all the cellular functions 
affected in these mutants. Finally, at least one of the BR biosynthetic genes has been 
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shown to be modulated by light, cytokinins, and the carbon source (Szekeres et al. 
(1996), supra). 

Mutations in axr2 result in a dwarf growth habit and a dark-grown phenotype 
with short hypocotyl and open cotyledons (Timpte et al. (1992), supra). In addition, 
5 axr2 mutants are resistant to auxin, ethylene, and abscisic acid and have defective root 
and shoot gravitropism. The dwarf phenotype in axr2 mutants has been shown to be 
due to reduced cell elongation and is rescued by BL (Szekeres et al. (1996), supra). 
This suggests that at least one of the multiple hormone signaling pathways affected in 
axr2 involves a BR-dependent step. Mutations at another locus, acaulisl, also have a 

10 significant reduction in cell elongation, but the defect is confined to inflorescence 
stems and leaves (Tsukaya et al. (1993) Development 118:751-764). Flowers are fully 
fertile and mature into normal-sized sihques with normal seed set. There is no change 
in hypocotyl length. If BRs are directly involved in this apparently organ-specific 
signaling pathway, it may be due to organ-specific responsiveness to individual BR 

1 5 species. With regard to the mechanism of action of BRs, at the moment one can only 
speculate that the target may be a component of the cell expansion machinery. 
Perhaps steroid signaling initiates a series of events leading to cell wall loosening. 

Example 7: The Elongation Defect of dwf4 Leads to a Light-Regulatory 
20 Phenotype 

The BR-deficient mutant det2 was originally identified as defective in 
regulation by hght (Chory et al. (1991), supra). Given the similarity of det2 and dwf4 
phenotypes and fimctions and in view of the observation that dwf4 is epistatic to hy2, 
one can predict that the etiolation response, which includes significant hypocotyl 

25 elongation, would not be normal in dwf4. To assess to what extent the etiolation 
response is affected by BR-dependent cell elongation, we grew dwf4 and wild-type 
plants on agar under continuous hght or in complete darkness, as described above in 
Example 1 . After 7 days of growth in the light, wild-type seedlings displayed open 
and expanded cotyledons as well as emerging leaf buds. In contrast, the overall 

30 appearance of Hght-grown dwf4 seedlings was strikingly similar to that of det2 (Chory 
et al. (1991), supra), dwf4 hypocotyls were very short, and the cotyledons were 
smaller than those of the wild type, displaying significant epinastic growth. As 
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expected, dark-grown wild-type seedlings had a typical etiolated appearance, with a 
highly elongated hypocotyl and closed, unexpanded cotyledons. However, dwf4 
hypocotyls failed to elongate. That the dwf4 mutation can abolish the elongation 
component of the etiolation response is in agreement with the notion that the block in 

5 cell elongation in dwf4 is specifically a BR-dependent process. 

In addition to short hypocotyls, dark-grown dwf4 seedlings displayed partially 
open cotyledons and leaf primordia, with up to four leaf buds clearly visible. This has 
not been observed with the wild type, although it occurs with certain light-regulatory 
mutants (Chory et al. (1989b), supra\ Deng et al. (1991) Genes Dev. 5:1 172-1 182; 

10 Wei and Deng (1992), supra). dwf4 leaf development continued in the darkness for 
several weeks, resulting in significant expansion of rosette leaves. These results 
indicate that dwf4 plants can initiate what is normally a photomorphogenic pathway in 
the absence of fight. Although this is often diagnostic of a light-regulatory mutant, 
wild-type Arabidopsis can perform leaf development and even flowering in complete 

15 darkness when grown in liquid culture (Araki and Komeda (1993) Plant 1 
4:801-811). 

The cause for this dark-flowering effect is not understood; therefore, the 
possibility exists that leaf development in dark-grown dwf4 is related to dark 
flowering and not to a hght-regulatory defect. For example, perhaps the proximity of 

20 the dwf4 shoot apical meristem to the surface of the agar, due to the shortness of the 
hypocotyls, mimics some effect of submerged culture, such as a high water potential 
or a high concentration of some nutrient. To test this possibility, wild-type seedlings 
were grown in complete darkness for 6 weeks in vertically oriented dishes to 
maximize contact between the seedling and the medium. Wild-type seedlings grown 

25 in this fashion displayed open cotyledons and imderwent at least partial leaf 

development. In fact, all wild-type seedlings grown along the surface of the agar 
showed development of an inflorescence with at least one cauhne leaf and a terminal 
flower bud. We conclude, therefore, that the appearance of leaves in dark-grown dwf4 
may be due simply to its short size and the culture conditions. 

30 A number of light-regulatory mutants have been described that undergo 

photomorphogenesis in the dark at the cellular level. In mutants such as copl, cop8, 
cop9, cop 10, and copll, stomata undergo photomorphogenic maturation (Deng and 
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Quail (1992), supra ; Wei and Deng (1992), supra ; Wei at al. (1994), supra ); of 
these, copl and cop9 as well as detl (Chory et al. (1989b), supra) also initiate 
differentiation of plastids into chloroplasts. To determine whether dwf4 plants 
undergo photomorphogenic cellular differentiation in the dark, we analyzed 
5 cotyledons from light- and dark-grown plants by transmission and scanning electron 
microscopy. Analysis of plastids in thin sections from 7-day-old dark-grown 
seedlings showed no difference between the wild type and dwf4. Both lines contained 
normal chloroplasts when grown in the light, whereas dark-grown seedlings contamed 
etioplasts, with their characteristic prolamellar body and no significant organization of 
1 0 thylakoids. Analysis of stomatal structures on the underside of cotyledons from 
7-day-old seedlings indicates that stomatal development was not completed in the 
dark, because the stomatal opening was occluded in both lines. The majority of 
light-regulatory mutants analyzed to date displayed Ught-grown morphology in the 
dark without concomitant chloroplast or stomatal development. As in these mutants, 
1 5 therefore, the dwf4 mutation uncouples the developmental pathway of seedling 
morphology from that of hght-regulated cellular differentiation. 

An additional feature of many light-regulatory mutants is that 
photomorphogenesis in the dark is accompanied by expression of genes that normally 
are light induced (Chory et al. (1989b), supra , Chory et al. (1991), supra ; Deng et al. 
20 (1991), supra ; Wei and Deng (1992), supra ; Hou et al. (1993), supra ; Wei et al. 
(1994), supra ). To assess whether dwf4 is able to induce Hght-regulated transcripts in 
the dark, we compared the activity of a CAB promoter fused to the Escherichia coli 
gene uidA, encoding p-glucuronidase (GUS), in Ught- and dark-grown dwarf and 
wild-type plants. The CAB-uidA fiision in pOCA107-2 (Li et al. (1994) Genes Dev. 
25 8:339-349 ) was crossed into dwf4, and F2 dwarf and wild-type plants were grown in 
the dark or Ught for 12 days, followed by determination of GUS activity by 
fluorometry (Gallagher (1992). "Quantitation of GUS activity by fluorometry" in 
GUS Protocols, S.R. Gallagher, ed (New York: Academic Press), pp. 47-59). 

The results demonstrated that when grown in the Ught, both wild-type and 
3 0 dwf4 seedlings contained GUS activity, which was significantly reduced in both lines 
when grown in the dark. Moreover, dark-grown dwf4 seedUngs displayed no GUS 
activity above the background present in dark-grown wild-type plants. The absence of 
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light-induced gene expression in the dark is a distinguishing feature of certain cop and 
det mutants, such as cop2, cop3, and det3. Because we have shown that the defect in 
cell elongation ofdwf4 is specifically rescued by BRs, even in the presence of light, 
we conclude that this is not a Ught-regulatory mutant. That its phenotype is partially 
deetiolated or constitutively photomorphogenic is a secondary effect of its reduced 
stature and the growth conditions. 

Example 8: Abnormal Skotomorphogenesis as a Consequence of the Dwarf 
Growth Habitat 

When dwf4 is grown in the light, its morphology is similar to that of various 
cop and det mutants, with multiple short stems, short rounded leaves, loss of fertiUty 
due to reduced stamen length, and delayed development (Figure 6). Dark-grown dwf4 
seedlings possess short hypocotyls, open cotyledons, and developing leaves. 
Therefore, it is tempting to speculate that this mutant may be defective in the control 
of light-regulated processes. On the other hand, because a dark-flowering phenotype 
has been demonstrated for liquid-grown Arabidopsis (Araki and Komeda (1993), 
supra), and given that agar medium is mostly water, it is especially significant that it 
is the dwarf seedlings, whose apical meristems are very close to the agar surface, that 
display a Ught-grown phenotype in the dark. Furthermore, because wild-type 
seedlings grown along the surface of the agar reproduce the dark-flowering 
phenotype, it is possible that the apparent light-regulatory defect of dwarf seedlings is 
a dark-flowering response. This possibility is strengthened by the observation that 
wild-type seedhngs (ecotype Wassilewskija [Ws-2]) grown in the dark on horizontally 
oriented plates occasionally bend down and touch the agar surface, and these 
seedlings invariably produce leaves. 

In addition, of the eight DWF loci identified in this laboratory, only the 
shortest mutants displayed open cotyledons and leaf bud development; in the case of 
dwfl (Feldmann et al. (1989), supra), this aberrant skotomorphogenesis is confined to 
the most severely affected alleles, hi addition to the presence of a short hypocotyl and 
at least partially open cotyledons in the dark, copl (Deng and Quail (1992), supra), 
detl (Chory et al. (1989b), supra), and det3 (Cabrera y Poch et al. (1993) Plant J. 
4:671-682) have been shown to initiate leaf formation in the dark. In mutants such as 
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copl, cop8, cop9, cop 10, and cop 11, stomata undergo photomorphogenic maturation 
(Deng and Quail (1992), supra; Wei and Deng (1992), supra; Wei et al. (1994)), 
supra); of these, copl and cop9 as well as detl (Chory et al. (1989b), supra) also 
initiate differentiation of plastids into chloroplasts. dwf4 displayed, in addition to a 
5 light-grown dwarf phenotype, a dark- growth phenotype of short hypocotyls, open 
cotyledons, and developing leaves; however, in contrast with the light-regulatory 
defect seen with whole plants, the cellular differentiation phenotype was unaffected. 
In dark-grown dwarf seedlings, stomata did not complete their development, and 
differentiation of chloroplasts was not observed. The absence of a cellular 

10 light-regulatory phenotype in dwf4 is similar to that of a number of 

photomorphogenic mutants, such as detl, detS, cop2, cop3, and cop4 (Chory et al. 
(1991), supra; Cabrera y Poch et al. (1993), supra; Hou et al. (1993), supra). 

In view of the dark-flowering phenotype on agar and the absence of a 
light-regulatory defect in differentiating cells, we conclude that at least in the case of 

1 5 dwf4, aberrant skotomorphogenesis may be a consequence of a dwarf growth habit 
rather than dwarfism being part of a defect in the control of light-regulated processes. 
This effect may also explain the Ught-regulatory phenotype found in other mutants 
with severely reduced height, such as axr2 (Timpte et al. (1992), supra), and strong 
alleles of dwfl, both of which are also rescued by exogenous BRs (Szekeres et al. 

20 (1996), supra). 

Example 9: Feeding Experiments with BR Biosynthetic Intermediates 

In view of the results described above, we hypothesized that DWF4 mediates 
one or more of several steroid hydroxylation steps in the BR biosynthetic pathway. 

25 To test this, dwf4 was grown on all of the available biosynthetic intermediates in the 
BR biosynthetic pathways and examined to ascertain which intermediates could 
rescue the dwarf phenotype. In addition to the intermediates belonging to the early 
C-6 oxidation and late C-6 oxidation pathways (Choi et al. (1997), supra), 
22a-hydroxycampesterol (22-OHCR), 6a-hydroxycathasterone (6-OHCT) (Takatsuto 

30 et al. (1997) Chem. Res. (synop.) 11:418-419), and 6a-hydroxycastasterone 

(6-OHCS) (S. Takatsuto, T. Watanabe, T. Noguchi, and S. Fujioka, unpublished data) 
were synthesized and tested. 
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Germinated seedlings were transferred to media supplemented with one of the 
intermediates or BL to pinpoint the step catalyzed by DWF4. Cathasterone (CT; early 
C-6 oxidation pathway), 6-OHCT, 6-deoxocathasterone (6-deoxoCT; late C-6 
oxidation pathway), and 22-OHCR, and all of the downstream compounds belonging 
5 to each branch, rescued the light-grown dwf4 phenotype, whereas the known 
precursors failed to cause an elongation response. Rescued seedlings exhibited 
greatly elongated cotyledonary petioles and expanded cotyledons, moderately 
elongated hypocotyls, and leaves that were larger and not as curled compared with 
nonrescued dwarfs. In addition, the rescued seedlings were less green than the 

10 dwarfs. These experiments were conducted in liquid media. Feeding experiments 
performed in the dark yielded similar results. 

Dose-response tests on the putative substrates and products of DWF4 were 
also performed. dwf4 seedlings failed to respond to 6-oxocampestanol (6-oxoCN) 
even at high concentrations (3 x 10'^ M). However, on CT the overall morphology of 

15 dwf4 was essentially rescued to wild-type phenotype at 3 x 1 0'^ M and higher, 

whereas with 6-deoxoCT, rescue occurred with as little as 10"^ M and may have even 
been inhibitory at higher concentrations. Of particular interest is the more dramatic 
response of the epicotyls versus the smaller response of the hypocotyls to CT, This 
same phenomenon was true for seedlings treated with >10'^ M 6-deoxoCT, At 

20 concentrations >1 0'^ M, the seedlmgs displayed an inhibition in hypocotyl and root 
elongation as well as cotyledon and leaf expansion. 

In a dose-response experiment performed in the dark, the seedlings failed to 
respond to 6-oxoCN (10"^ to 3 x 10'^ M). A higher concentration of CT for 
dark-grown seedlings, compared with light- grown seedlings, 3 x 10'^ M (Figure 5B), 

25 was required to convert the hypocotyl to a length similar to that of the wild type. 

High concentrations of 6-deoxoCT caused dramatic elongation but were less effective 
at rescuing dwf4 hypocotyls to wild-type phenotype. 

To determine whether the results of the seedling feeding experiments could be 
applicable to soil-grown mature plants, 6-week-old dwf4 plants were treated with BR 

30 intermediates and BL. Concentrations of applied intermediates were adjusted 
empirically to optimize responses. Consistent with the results obtained from the 
seedling experiments, only 22a-hydroxylated compoimds can rescue the d-wf4 
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phenotype. The elongation response was only observed in the young tissues of the 
inflorescence, regardless of whether the BRs were appUed locally or sprayed over the 
entire plant. La contrast to the striking elongation of the peduncles and pedicels, 
fertility was not restored by BR treatment. The sterility in dwf4 is hypothesized to be 

5 mechanical, which means that the filaments are shorter than the carpels such that the 
pollen is shed onto the ovary walls rather than onto the stigmatic surface. In fact, if 
dwf4 plants are hand pollinated using dwf4 pollen, fertility increases. 

Pedicels displayed a more consistent response to exogenously appHed BRs 
than did intemodes, which led us to quantify the sensitivity of pedicels to these 

1 0 compounds. As shown in Figure 7, dwf4 pedicels were more sensitive to BR 

intermediates belonging to the late C-6 oxidation pathway, 6-deoxoCT (10"* M) and 
6-deoxoteasterone (6-deoxoTE; 10 ' M), compared with CT (10"' M) and teasterone 
(TE; 1 0-^ M) of the early C-6 oxidation pathway. The end product of the BR pathway, 
BL (10"^ M), possessed the highest bioactivity. This concentration induced 

15 approximately the same degree of response as its precursor compounds at 10"' M. 
Finally, application of 22-OHCR (10"' M) also resulted in a dramatic elongation 

response (Figure 7). 

Rescue oidwf4 by 22-hydroxylated steroids confirms that the missing step in 
dwf4 is hydroxylation at the C-22 position. In fact, we found that the chemically 
20 synthesized 22-OHCR was also effective in rescuing dwf4 (Figure 7) hydroxylation at 
C-22. These results indicate that there is no defect other than 22a- hydroxylation in 
dwf4 plants. 

In BR biosynthesis, Fujioka and Sakurai (1997b), supra have demonstrated 
that there are at least two branched biochemical pathways to the end product BL 

25 (Figure 1; Fujioka and Sakurai (1997a), supra, Fujioka and Sakurai (1997b), supra; 
Sakurai and Fujioka (1997), supra). Depending on the oxidation state of C-6, they are 
referred to as the early or late C-6 oxidation pathways. In the early pathway, the C-6 
is oxidized to a ketone at campestanol (CN), whereas in the late pathway it is oxidized 
at 6-deoxocastasterone (6-deoxoCS). Otherwise, the two pathways share equivalent 

30 reactions. Our results from the experiments with the available BR intermediates 
clearly demonstiate that dwf4 is defective in the 22a-hydroxylation steps in each of 
the pathways. AppUcation of all 22a-hydroxylated intermediates in these pathways. 
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such as CT and 6-deoxoCT, cause dramatic elongation of d-wf4 plants, but compounds 
not hydroxylated at C-22 had no effect. This resuU also suggests that DWF4 
recognizes at least two substrates: CN and 6-oxoCN. It seems reasonable to 
hypothesize that the same result will be found for CPD, a 23a-hydroxylase; that is, it 
will use 6-deoxoCT as well as CT as substrate. 

The rescue of dwf4 by 22-OHCR is an important observation. First, it 
confirms DWF4 as a 22a-hydroxylase. Second, this result suggests that 22-OHCR 
was metabolized to induce the same responses as other complementing BRs. This is 
not just a general effect because our unpubUshed data show that another dwarf mutant 
that we have identified in our screens, dwfS-l, is not rescued by this compound. 
Finally, these feeding experiments suggest that the metabolism of 22-OHCR may 
represent a new subpathway in the BR biosynthetic pathway. If this compound also 
exists in vivo and constitutes the first step in a separate subpathway, by analogy to the 
chemical structure, the C-6 hydroxylated BRs, for example, 6-OHCT, 
6-hydroxyteasterone, and so on, may be possible intermediates in this network. If so, 
the intermediates in this pathway may play a role as bridging molecules between the 
early and late C-6 oxidation pathways. Alternatively, it might be possible that 
22-OHCR merges into one of the two pathways to be metabolized. In this case, the 
late C-6 oxidation pathway is the best candidate; our unpubhshed data show that 
22-OHCR is more effective in the Ught in rescuing the dwf4 phenotype, which is true 
for all of the intermediates in the late C-6 oxidation pathway. 

Currently, biochemical feeding studies suggest that the two pathways merge to 
produce BL or CS (Yokota et al. (1991), Metabohsm and biosynthesis of 
brassinosteroids. In Brassinosteroids: Chemistry, Bioactivity, and Application, H.G. 
Cutler, T, Yokota, and G. Adam, eds (Washington, DC: American Chemical Society), 
pp. 86-96; Yokota, et al. (1997) Plant Physiol 115(suppl.):169; Figure 1). Several 
lines of evidence indicate that seemingly redundant pathways can be utilized to 
respond to environmental or developmental signals. First, the pathways could 
respond to specific signals. For instance, it is possible that various cues such as light, 
dark, or developmental signals play a role in regulating these subpathways. Our 
feeding experiments consistently showed that BRs in the late C-6 oxidation pathway 
are more effective at promoting cell elongation in hght-grown plants (dwf4 and wild 
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type; Figure 7) and that the BRs belonging to the early C-6 oxidation pathway are 
more active in dark-grown seedlings. Thus, it may be possible that the late C-6 
oxidation pathway operates in the Ught and that the early C-6 oxidation pathway 
functions primarily in the dark. Second, rather than a simple merger of branched 
pathways to BL as an end product, each intermediate may have nascent bioactivity. 
The in vivo ratio or composition of BRs at different oxidation states may resuh in 
different responses. Noticeably distinctive phenotypes for the various BR dwarfs, 
defective in different biosynthetic steps, support this idea. Third, the biosynthetic rate 
of each pathway toward production of the end product may differ. In this case, the 
biosynthetic rate could be modulated by controlUng the level of gene expression or 
the activity of participating enzymes. Certain signals, requiring different rates of BR 
biosynthesis, may induce one of the subpathways, which would then affect the 
concentration of the intermediates in one pathway relative to the other. 

Of the steps in BR biosynthesis in Madagascar periwinkle, the 
22a-hydroxylation reaction has been suggested to be the rate-limiting step (Fujioka et 
al. (1995a) BioscL Biotech. Biochem, 59:1543-1547). In periwinkle, the endogenous 
level of CT was as low as one-twenty thousandth of CR; however, CT was almost 500 
times more active than 6-oxoCN in the rice-lamina inchnation assay (Fujioka et al. 
(1995b) Biosci. Biotech. Biochem. 59:1973-1975). Based on these results, we propose 
that the step encoded by DWF4 serves as the rate-limiting reaction and that once past 
this step, the intermediates are easily converted to the end product. Although 
biochemical studies on DWF4 need to be performed to ascertain whether it mediates 
the rate-limiting step, DWF4 seems to be greatly downregulated compared with CPD, 
the next enzyme in the pathway; RT-PCR revealed that the DWF4 transcript is much 
less abundant than the CPD transcript. 

Example 11: 

A. Promoter and overexpression constructs 

Two promoter constructs were used for the DWF4-promoter::GUS (D4G) 
analysis. For promoter fusions, polymerase chain reaction (PCR) products spanning 
1.1 kb DNA upstream of the translation initiation site were amphfied using primers 
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D4XLINIT (5'-TAGGATCCAGCTAGTTTCTCTCTCTCTCT-3') (SEQ ID NO: 16) 
and a T7 primer (5'-TAATACGACTCACTATAGGG-3') (SEQ ID N0:17). For 
template for PGR, a DWF4 genomic clone subcloned into pBluescript SK- vector 
(Stratagene, La JoUa, CA) was used as described herein. The PGR products were 
5 restricted with SaR and BairiHl, and ligated into the same restriction site of a 

promoterless GUS vectorpBIlOl; this 1.1 kb promoter: :GUS construct was named 
pD4GL. For the pD4GS construct, pD4GL was digested with Hindlll, the small 
restriction fragment was removed, and the remaining vector with the partial promoter 
was self-ligated. The constructs were introduced into Agrobacterium strain GV3101 
10 through electroporation. 

For a DWF4 overexpression construct, PGR products were made by using 
D40VERFA (5'-GAATTCTAGAATGTTCGAAACAGAGCATCATA-3') (SEQ ID 
NO: 18) and D4R2 (5'-CCGAACATCTTTGAGTGCTT-3') (SEQ ID NO: 10) primers 
and Wassilewskija-2 (Ws-2) genomic DNA. The PGR products were cut with Xbal 
1 5 and HindUl, and inserted into the same restriction sites of genomic clone SCH25 
containing a 2.5 kb Hindm fragment of the DWF4 DNA corresponding to the 3' half 
of the gene. The resulting recombinant DNA clone pD4CDS, containing the whole 
coding sequence from the translation initiation site to 694 bp downstream of the stop 
codon, was cut with Xbal and fransferred to an overexpression vector pART27 
20 (Gleave (1992), Plant Molec. Bio. 20:1203-1207). The resultmg binary construct was 
named pOD4. This construct was introduced into Agrobacterium through 
elecfroporation. 

B. Spray transformation 

25 Since it has been shown that Agrobacterium-mediatcd transformation can 

work by seed infection (Feldmann and Marks (1987) Molec. Gen. Genet. 208:1-9) or 
by simply dipping the host plants into Agrobacterium culture, we decided to try 
spraying the Agrobacterium directly onto the plants. In addition to spraying , the 
"floral dip" method was used as described (Clough and Bent (1998), infra). About 20 

30 Wassilewskija-2 (Ws-2) wild-type and dwf4-4 seeds were sprinkled on 10 cm pots, 
and thinned to 5-6 plants per pot 10 days (wild type) and 20 days (dwf4-4) after 
germination. When the primary inflorescences of the wild type reached 3-4 cm in 
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height, they were decapitated to induce axilary bolts. dwf4-4 plants were used without 
decapitation. For the preparation of Agrobacterium, a single colony selected on 20 
Hg/ml kanamycin in Luria-Bertani (LB) medium (10 g bacto-tryptone, 5 g bacto-yeast 
extract, 10 g NaCl per liter, pH 7) was inoculated into 100 ml liquid LB media, and 
5 grown for 3 days. One ODgoo unit equivalent cells were used to inoculate 1 00 ml LB 
media. The overnight grown cells were collected by centrifiigation, and resuspended 
with transformation media as described in Clough and Bent (1998), infra (5% sucrose 
and 0.05% Silwet L-77, ODgoo = !)• The Agrobacterium suspension was sprayed onto 
plants on the third day after decapitation. To avoid physical contact with possibly 
1 0 hazardous Silwet vapor, protective glasses were used and the spraying was done in a 
fume hood. To test the efficiency of repeated spraying, plants were sprayed every 
third day (3x). Sprayed plants were grown to maturity and seeds harvested. For seed 
sterilization 0.07 g seeds were surface sterilized by treating for 2 min in 70% ethanol, 
15 min in bleach solution consisting of 5% Clorox and 1% SDS, followed by three 
15 rinses with sterile water. To plate the seeds 25 ml of sterile top agar (0.15% agar) was 
added to the steriUzed seeds and the seed mixture was poured onto Murashige and 
Skoog sohd plate (100 x 15 mm, Murashige and Skoog salts, 5% sucrose, 0.08% agar, 
pH 6) supplemented with kanamycin or hygromycin at 60 ng/ml and 40 ^g/ml, 
respectively. Twelve days after germination kanamycin resistant were transferred to 
20 single pots, and grown to maturity. T2 seeds were collected from individual 

transformants (Tl), and plated again on the selection media to determine segregation 
ratios for drug-resistant versus sensitive plants. Arabidopsis transformants were 
udrnQdArabidopsis Overexpressor of DWF4 {A0D4) when harboring an 
overexpression construct pOD4, and DWF4-promoter::GUS (D4G) for transformants 
25 containing a GUS fiision gene. Homozygosity for the transgene was determined when 
no sensitive T4 seedlings segregated from >500 T3 individuals. Morphometric 
analysis of A0D4 lines and GUS histochemical analysis of D4GL plants was 
performed using plants homozygous for the fransgene. 

For histochemical analysis of the D4GL plants, seeds were plated on M&S 
30 plates and grown in the dark and light. Seedlings were harvested at the designated 
dates and stained overnight using a substrate mixture (0.1 M NaP04, pH 7, 10 rtiM 
EDTA, 0.5 mM K3Fe(CN)6, 0.5 mM K4Fe(CN)„ 1 mM X-glucuronide, and 0.1% 
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Triton X-100). Seedlings cleared with 90% ethanol were rehydrated before taking 
pictures using a Stemi SVl 1 dissecting microscope (Zeiss, NY). 

Transgenic tobacco plants {T0D4) harboring the pOD4 constructs were 
produced in the plant tissue culture laboratory at the University of Arizona. Protocols 

5 for the regeneration of transgenic plants from lead discs of Nicotiana tabacum var 
Samsun will be provided on request. Fifteen independent transformants for both the 
control and 0D4 constructs were grown for seeds. Morphological analysis of the 
T0D4 lines was performed using T2 plants in the course of growth for 4 months in 
the green house (30°C). Methods for Arabidopsis growth and RNA gel blot analysis 

1 0 were previously described herein. Briefly, seeds of wild type and the two A0D4 lines 
were germinated on M&S agar media. 10 days after germination, 20 seedlings 
confirmed to be resistant to kanamycin were transferred to a single pot. Various 
morphological traits (Table 3) were measured. To determine the seed production, 
after 8 weeks from germination, plants were fiirther dried for two weeks at room 

15 temperature. Seeds were harvested from an individual plant and weighed. To 

measure the seed size, seeds were magnified 3 times under the dissecting microscope, 
the width and the length of five seeds from each plant were measured to the nearest 
tenth of mm. 

20 C. DWF4 transcription is localized to zones of cell division and elongation 

To locahze BR biosynthesis, RNA gel blot analysis with total RNA isolated 
from nine different tissues of three-week old plants was performed. The DWF4 
transcript was barely detectable in shoot tips, roots, dark-grown seedlings, callus and 
axilary buds, but the levels were below the detectable limit in the other tissues 

25 examined, including stems, siUques, pedicels, and rosette leaves. For finer localization 
of the expression, the expression of the GUS reporter gene controlled by the DWF4 
promoter was examined. 

Prior to performing DWF4-promoter.:GUS gene fiision analysis, a 1 . 1 kb 
fragment of DNA upsti-eam of the DWF4 translation start site was tested to ensure that 

30 it contained all of the necessary sequence elements for proper transcriptional control 
of DWF4. dwf4-4 plants were transformed with a 4.8 kb constinct consisting of a 1.1 
kb promoter region and 3.7 kb that contained the complete DWF4 coding sequence. 
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Vor Agrobacterium-medisLted transformation of Arabidopsis plants, a "spray 
transformation" protocol was employed rather than traditional methods. Spray 
transformation yielded a comparable number of transformants relative to the 
traditional "floral dip" (Clough and Bent (1998) Plant J. 16:735-743) or "vacuum 
infiltration" methods (Bechtold et al. (1998) Methods Mol. Biol. 82:259-266). 
Interestingly, repeated spraying resulted in an increased number of transformants. 
Transformants harboring the DWF4 genomic DNA displayed a wild type phenotype, 
suggesting that the promoter segment contained the necessary information for proper 
expression of the gene. 

For histochemical staining analyses of transgenic plants harboring the DWF4- 
promoter::GUS {D4G) recombinant gene, two different 1)-^^ constructs were made 
and tested. D4GL contained the 1.1 kb promoter fragment, whereas D4GS carried 
only a TATA-like promoter region (280 bp). GUS staining in 20 independent 
transformants containing D4GS was either not detected or inconsistent between 
transformants. However, the 20 transgenic plants containing D4GL displayed a 
consistent GUS staining pattern, suggesting that the 1.1 kb promoter is required for 
the proper transcriptional control ofDWF4. 

Analyses of GUS staining pattems in T2 plants homozygous for D4GL 
revealed that GUS activity was present in tissues with actively dividing or elongating 
cells. These include shoot apical meristems, leaf primordia, collet (the junction 
between hypocotyl and root), and root tips, including lateral root primordia, as shown 
in 6-day old Ught-grown seedlings. Interestingly, dark-grown seedlings displayed 
GUS activity in cotyledons whereas the staining was not detectable in the cotyledons 
of hght-grown seedlings, hi adult plants, GUS activity was detected in floral 
primordia, carpels, and the basal end of the filaments of unopened flowers, whereas 
GUS activity in sepals, petals, and mature pedicels was not detected. The shoot tips, 
bases of emerging branches, and primordia of axilary inflorescences were GUS 
positive, whereas elongated intemodes were negative. Embryos in the seeds of the 
fully elongated siUques were weakly positive for GUS staining, suggesting a role for 
BRs in embryo development. Leaf primordia, young leaves, expanding leaf margins, 
and the base of petioles displayed GUS activity, but old leaf blades were negative for 
GUS staining. The tissues positive for GUS staining confirmed the expression pattern 
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examined by northern analysis with the tissue-specific RNA. 

Since DWF4 is proposed to be a key enzyme in the BR biosynthetic pathway, 
DWF4 transcription could be regulated by an end-product feedback mechanism. To 
this end, D4GL was expressed in different genetic backgrounds including two BR 
deficient mutants, dwf7-l and dwf8-l, and a BR-enriched line, A0D4. GUS activity 
was increased in dwf7-l and dwf8-l but decreased in A0D4 lines. DWF7 is a C-5 
desaturase that acts in the sterol specific part of the pathway. D4GL activity in dwf7-l 
was found in the same tissues as wild type but. dwf8-l is defective in a BR 
biosynthetic step downstream of CPD. hi dwf8-l, the intensity of the D4GL activity 
was noticeably stronger as compared to wild type but the expression patterns were 
relatively diffuse. dwf8-l was also found to express GUS at nascent sites as compared 
to wild type. In wild type, D4GL expression in the cotyledons of Ught-grown 
seedlings was not detected, but dwf8-l displayed considerable D-^GZ activity in the 
cotyledons. Also in contrast to wild type, GUS activity was detected throughout the 
hypocotyls of dwf8-l light-grown seedlings, suggesting that D^GZ, transcription is 
upregulated in dwf8-l in a more general manner. Conversely, GUS activity was 
greatly reduced in AOD4-4 plants. Also, in A0D4 plants GUS activity in the root tip 
and collet was completely eliminated, whereas the shoot tip retained residual activity, 
suggesting that increased levels of BRs in AOD4-4 may have resulted in lower GUS 
activity. The down-regulation of GUS activity was similarly found if D4GL plants 
were exogenously suppHed with 10'^ M 24-epibrassinoUde (epi-BL). Seedlings 
treated with epi-BL displayed greatly reduced GUS activity in tissues normally 
stained in untreated control plants, suggesting that exogenously appUed epi-BL 
effectively down-regulates D4GL activity. However, hypocotyls of D4GL plants 
supplemented with lO'"^ M GA3, while longer than controls, did not display an 
increase in GUS staining in shoots and roots. This suggests that GA3 or GAs-induced 
elongation did not affect D4GL transcription in these tissues. 

D. DWF4 overexpression results in elongated hypocotyls in Arabidopsis and 
tobacco seedlings 

A DWF4 overexpression construct (pOD4) was made by placing the DWF4 
genomic DNA under the control of the CaMV 35S promoter. RNA gel blot analysis, 
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with total RNA isolated jfrom the transgenic lines containing the overexpression 
constract, showed that DWF4 transcripts were greatly increased in both Arabidopsis 
and tobacco, whereas the level was not readily detectable in either wild type or in 
dwf4'l plants. Similar to increased mRNA transcripts, the 80 independent AOD4 
transgenic plants had longer hypocotyls and inflorescences. 

To compare the phenotypic effects resulting from the endogenous and 
exogenous addition of BRs, the length of roots and hypocotyls of 16 seedUngs of 
dwf4, wild-type controls, wild-type plants supplemented with 10"^ M epi-BL, and two 
independent A0D4 lines, grown for 12 days in the light or dark was measured. As 
described herein, dwf4-l displayed greatly reduced hypocotyl length both in the light 
and dark as compared to wild type. Wild-type roots are shortened when grown in the 
dark, but dwf4-l root length was not significantly reduced in the dark compared with 
the reduction in hypocotyl length. When epi-BL is added, hght-grown wild type 
seedlings developed elongated hypocotyls, whereas roots were shorter than untreated 
control plants. These characteristic responses of wild-type plants to epi-BL treatment 
were similar in two independent AOD4 lines. The hypocotyl length of hght-grown 
A0D4 seedlings was comparable to that of seedlings treated exogenously with epi- 
BL. However, dark-grown hypocotyls showed a dramatic increase in length as 
compared to controls with or without epi-BL. Inhibition of root growth was also 
obvious in the A0D4 lines. Furthermore, the increased hypocotyl length and reduced 
root length were consistently observed in 15 independent transformants of tobacco 
{T0D4) harboring a pOD4 construct. This result suggests that the Arabidopsis DWF4 
enzyme also catalyzes BR biosynthesis in tobacco. 

E. DWF4 overexpression results in increased plant height, bigger leaves, and 
increased seed production 

As shown below in Table 3, the effects of DWF4 overexpression on plant 
growth were monitored during the course of development. The number of rosette 
leaves at bolting was not significantly different between wild-type and A0D4 plants 
(Table 3). The inflorescence height of wild type and two independent A0D4 lines 
were comparable 20 days after germination (DAG). Later, the A0D4 lines outgrow 
wild type. Surprisingly, A0D4 lines continue to grow beyond 35 DAG at the time 
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wild-type plants ceased elongation. At maturity, the height of A0D4 lines was 135% 
{AOD4-65) and 142% {AOD4'73) that of wild type, respectively. Similarly, T0D4 
also displayed a 14% increase in plant height as compared to the control. 
Interestingly, the increased inflorescence length in A0D4 plants seemed to be at the 
cost of stem stiffiiess. During development, A0D4 plants tend to fall over eariier than 
the Ws-2 wild type. In addition to plant height, comparison of rosette leaf size 
between wild type and A0D4 indicates that leaves, both rosette and cauline, are 
larger, especially in adult plants. T0D4 plants also possessed leaves that were larger, 
and had longer petioles relative to the control. Furthermore, additional secondary 
branches were found both in Arabidopsis and tobacco overexpression lines. lnA0D4 
plants, this additional branching was associated with >2 times increased number of 
siliques per plant, leading to a 33 and 59% increase in seed production (Table 3). The 
increased seed production in the A0D4 lines was mainly due to the increased number 
of seeds per plant than increase in the seed size, because the size was not significantly 
increased (Table 3). In addition to the increased number of seeds, the length of siUque 
as well as the length of an intemode between the first silique in a main inflorescence 
and the base of plant was increased (Table 3). 

Figure 8 shows that stem growth is increased more than 20% compared to 
wild type in DWF4 overexpression lines and Figure 9 shows that seed production is 
increased significantly over wild type in the DWF4-overexpressed lines. Figure 12 
depicts hypocotyl length and root length in light and dark. Further, the height of 
A0D4 lines was greater than wild type over the days examined. In addition, although 
wild type plants ceased growth around five weeks after germination, A0D4 plants 
continued to grow up to seven weeks. 
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Thus, the DWF4 locus is defined by at least four mutant alleles. One of these 
is the result of a T-DNA insertion. Plant DNA flanking the insertion site was cloned 
and used as a probe to isolate the entire DWF4 gene. Sequence analysis revealed that 
DWF4 encodes a cytochrome P450 monooxygenase with 43% identity to the putative 
5 Arabidopsis steroid hydroxylating enzyme CONSTITUTIVE 

PHOTOMORPHOGENESIS AND DWARFISM. Sequence analysis of two other 
mutant alleles revealed deletions or a premature stop codon, confirming that DWF4 
had been cloned. This sequence similarity suggests that DWF4 functions in specific 
hydroxylation steps during BR biosynthesis. The dwarf phenotype can be rescued 

10 with exogenously supplied brassinoUde. dwf4 mutants display features of 

Ught-regulatory mutants, but the dwarfed phenotype is entirely and specifically 
brassinosteroid dependent; no other hormone can rescue dwf4 to a wild-type 
phenotype. Feeding studies utilizing BR intermediates showed that only 
22a-hydroxylated BRs rescued the dwf4 phenotype, confirming that DWF4 acts as a 

15 22a-hydroxylase. In adult plants, strong GUS staining (indicative of dwf4 expression) 
was found in the primordia of axilary inflorescences and secondary branches, and in 
young developing flowers. GUS expressing tissues correspond to the tissues sensitive 
to exogenously applied BRs leading to the hypothesis that these tissues are putative 
brassinolide biosynthetic sites. The inflorescence height of DWF4 overexpressing 

20 lines increased >35% in Arabidopsis (AOD4) and 14% in tobacco {T0D4) as 

compared to control plants at maturity. The total number of branches and siliques 
increased >2-fold in A0D4 plants, leading up to a 59% increase in seed production. 
The phenotypes of dwf4^ DWF4, and A0D4 plants suggest that the degree of DWF4 
transcription is associated with the degree of BR effects. In sum, it appears be 

25 possible to engineer agricultural plants with increased biomass and seed yield. 
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Claims: 

What is claimed is: 

1 . An isolated dwf4 polynucleotide comprising an open reading frame encoding a 
5 polypeptide comprising (i) a sequence having greater than 43% identity to the amino 

acid sequence of SEQUENCE ID N0:2; (ii) a sequence comprising at least about 10 
contiguous amino acids that have greater than 43% identity to 10 contiguous amino 
acids of SEQUENCE ID N0:2, or a complement or reverse complement of said 
polynucleotide. 

10 

2. The isolated dwf4 polynucleotide of claim 1 wherein the polynucleotide has at 
least 70% identity to the DWF4 polypeptide-coding region of SEQ ID N0:1, 
complements and reverse complements thereof. 

15 3. The isolated dwf4 polynucleotide of claim 1 comprising the nucleotide 
sequence of SEQ ID N0:1, complements and reverse complements thereof 

4. The polynucleotide of claim 1 comprising at least 30 consecutive nucleotides 
of SEQ ID N0:1. 

20 

5. An isolated dwf4 polynucleotide comprising (i) a sequence having at least 
50% identity to SEQ ID NO:l, complements and reverse complements thereof or (ii) 
a sequence comprising at least about 15 contiguous nucleotides that has at least 50% 
identity to SEQ ID N0:1, complements and reverse complements thereof 

25 

6. The isolated dwf4 polynucleotide of claim 5 having at least 50% identity to the 
DWF4 polypeptide-coding region of SEQ ID N0:1, complements and reverse 
complements thereof 

30 7. The isolated dwf4 polynucleotide of claun 5, comprising the nucleotide 
sequence of SEQ ID N0:1, complements and reverse complements thereof 
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8. The polynucleotide of claim 5 comprising at least 30 consecutive nucleotides 
ofSEQIDNO:!. 

9. The isolated polynucleotide of claim 5, wherein the polynucleotide is genomic 
5 DNA. 

1 0. The isolated polynucleotide of claim 5, wherein the polynucleotide includes 
introns. 

10 11. A recombinant vector comprising (i) the polynucleotide of claim 1 ; and (ii) 
control elements operably linked to said polynucleotide whereby a coding sequence 
within said polynucleotide can be transcribed and translated in a host cell. 

12. A recombinant vector comprising (i) the polynucleotide of claim 5; and (ii) 

1 5 control elements operably linked to said polynucleotide whereby a coding sequence 
within said polynucleotide can be transcribed and translated in a host cell. 

13. A host cell comprising the recombinant vector of claim 1 1 . 
20 14. A host cell comprising the recombinant vector of claim 12. 

15. A method of modulating a DWF4 polypeptide comprising the following steps: 

(a) providing a host cell according to claim 14; and 

(b) culturing said host cell under conditions whereby the dwf4 polynucleotide 
25 is transcribed. 

16. The method of claim 1 5 , wherein the dwf4 polynucleotide is overexpressed. 

1 7 . The method of claim 1 5 , wherein expression of dwf4 is inhibited. 

30 

18. A transgenic plant comprising the recombinant vector of claim 1 1 . 
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19. A transgenic plant comprising the recombinant vector of claim 12. 

20. The isolated polynucleotide of claim 5, wherein the polynucleotide includes a 
dwf4 control element comprising a polynucleotide selected from the group consisting 

5 of (i) a sequence having at least 50% identity to nucleotides 1 to 3202 of SEQ ID 
N0:1; (ii) a fragment of (i) which includes a dwf4 control element; and (iii) 
complements and reverse complements of (i) or (ii). 

21 . The isolated polynucleotide of claim 5, wherein the polynucleotide includes a 
10 dwf4 control element comprising a polynucleotide selected from the group consisting 

of (i) a sequence having at least 50% identity to nucleotides 6111 to 6468 
corresponding to the 3' UTR of SEQ ID NO: 1 ; (ii) a fragment of (i) which includes a 
dwf4 y UTR; and (iii) complements and reverse complements of (i) or (ii). 

15 22. The isolated polynucleotide of claim 5, where in the polynucleotide includes a 
dwf4 control element comprising a polynucleotide selected from the group consisting 
of (i) a sequence having at least 50% identity to the sequences corresponding to the 
introns of SEQ ID N0:1; (ii) a fragment of (i) which includes a dwf4 intro; and (iii) 
complements and reverse complements of (i) and (ii). 

20 

23. The isolated polynucleotide of claim 22, wherein the introns are selected from 
the group consisting of nucleotides 3424 to 3503 of SEQ ID N0:1; nucleotides 3829 
to 3913 of SEQ ID N0:1; nucleotides 4067 to 4164 of SEQ ID N0:1; nucleotides 
4480 to 4531 of SEQ ID NO:l; nucleotides 4725 to 4815 of SEQ ID NO:l; 

25 nucleotides 4895 to 5000 of SEQ ID NO: 1 ; and nucleotides 5 1 1 1 to 5864 of SEQ ID 
NO:L 

24. A recombinant vector comprising: 

(a) the isolated polynucleotide which includes a dwf4 control element of claim 

30 20; and 

(b) a nucleic acid molecule comprising a coding sequence. 
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25 . A host cell transformed with the recombinant vector of claim 24. 

26. A method of producing a recombinant polypeptide comprising the following 
steps: 

5 (a) providing a host cell according to claim 25; and 

(b) culturing said host cell under conditions whereby the recombinant 
polypeptide encoded by the coding sequence present in said recombinant vector is 
expressed. 

10 27. A method of producing a transgenic plant comprising the steps of: 

(a) introducing the polynucleotide of claim 5 into a plant cell to produce a 
transformed plant cell; and 

(b) producing a transgenic plant from the transformed plant cell. 

15 28. A method for producing a transgenic plant having an altered phenotype 
relative to the wild-type plant comprising the following steps: 

introducing at least one polynucleotide of claim 5 into a plant cell; and 
producing a transgenic plant from the plant cell, said transgenic plant having 
an altered phenotype relative to the wild-type plant. 

20 

29. The method of claim 26, wherein the phenotype is selected from the group 
consisting of altered cell length, altered periods of flowering, altered branching, 
altered seed production, altered leaf size, elongated hypocotyls, ahered plant height, 
altered heme-thiolate enzyme activity, altered monooxygenase activity, altered 22a- 
25 hydroxylase activity, regulation of brassinosteriods, regulation of gibbereUic acid, 
regulation of cytokinins, regulation of auxins, altered resistance to plant pathogens, 
ahered growth at low temperatures, altered growth in dark conditions, and ahered 
sterol composition. 

30 30. The method of claim 28, wherein the phenotype is increased seed production. 
3 1 . The method of claim 28, wherein the phenotype is increased plant height. 
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32. The method of claim 28, wherein the phenotype is increased leaf size. 

33. The method of claim 28, wherein the phenotype is altered 22a-hydroxylase 
activity. 

5 

34. The method of claim 28, wherein the polynucleotide is operably hnked to a 
promoter selected from the group consisting of a tissue-specific promoter, an 
inducible promoter or a constitutive promoter. 

10 35. The method of claim 28, wherein the polynucleotide is overexpressed. 

36. The method of claim 28, wherein the polynucleotide inhibits expression of 
dwf4, 

15 37. The method of claim 28, wherein at least first and second polynucleotides are 
introduced into the plant cell, said first and second polynucleotides operably linked to 
at least first and second tissue-specific promoters, wherein said first polynucleotide is 
overexpressed and said second polynucleotide inhibits expression of dwf4, 

20 38. A method for altering the biochemical activity of a cell comprising the 
following steps: 

introducing at least one polynucleotide of claim 5 into the cell; and 
culturing the cell under conditions such that the biochemical activity of the 
cell is altered. 

25 39. The method of claim 38, wherein the biochemical activity is selected from the 
group consisting altered heme-thiolate enzyme activity, altered monooxygenase 
activity, altered 22a-hydroxylase activity, regulation of gibberellic acid, regulation of 
cytokinins, regulation of auxins, and altered sterol composition. 

30 40. The method of claim 39, wherein the cell is cultured ex vivo. 

41 . The method of claim 39, wherein the dwf4 polynucleotide is provided to the 
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cell in vivo. 



42. The method of claim 39, wherein more than one dwf4 polynucleotides are 
provided to the cell. 

5 

43. A method of transforming a host cell comprising the step of introducing into 
said cell the recombinant vector of claim 14. 

44. The method of claim 43, wherein the host cell is cultured ex vivo. 

10 

45. The method of claim 43, wherein the dwf4 polynucleotide is provided in vivo. 



46. A method for regulating the cell cycle of a plant cell comprising the following 
steps: 

1 5 providing a dwf4 polynucleotide to a plant cell; and 

expressing the dwf4 polynucleotide to provide a DWF4 polypeptide, wherein 
the DWF4 polypeptide is provided in amounts such that cell cycling is regulated. 

47. The method of claim 46, wherein the plant cell is provided in vitro and is 
20 cultured under conditions suitable for providing the DWF4 polypeptide. 

48. The method of claim 46, wherein the dwf4 polynucleotide is provided in vivo. 

49. A method of according to claim 25, wherein the DWF4 polypeptide is 
25 provided in amoimts such that a plant is regenerated. 

50. The method of claim 49, wherein the plant cell cultured ex vivo. 

5 1 . The method of claim 49, wherein the dwf4 polynucleotide is provided in vivo. 

30 

52. An isolated DWF4 polypeptide comprising (i) a sequence having greater than 
43% identity to SEQ ID N0:2 or (ii) fragments of (i) that confer a DWF4 phenotype 
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when expressed in a host organism. 

53. The isolated DWF4 polypeptide of claim 52 comprising the amino acid 
sequence of SEQ ID N0:2. 

5 

54. An isolated polynucleotide comprising a polynucleotide according to claim 5 
operably linked to a nucleic acid molecule encoding a heterologous polypeptide. 

55. The polynucleotide of claim 54 wherein the heterologous polypeptide is a 
1 0 cytochrome P450 polypeptide. 

56. A chimeric polypeptide comprising a DWF4 polypeptide according to claim 
52 and a heterologous polypeptide. 

1 5 57. The chimeric polypeptide of claim 56 wherein the heterologous polypeptide is 
a cytochrome P450 polypeptide. 
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Abstract of the Disclosure 

DWF4 POLYNUCLEOTIDES, POLYPEPTIDES AND USES THEREOF 

5 The present invention relates to novel, polynucleotides isolated from dwarf 

plants. The dwf4 polynucleotides that encode all, or a portion of, a DWF4 
polypeptide, a cytochrome P450 enzyme that mediates multiple steps in synthesis of 
brassinosteroids. The present invention also relates to isolated polynucleotides that 
encode regulatory regions of dwf4. Uses of the dwf4 polypeptides and 
10 polynucleotides are also disclosed. 
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-CYP120 (Cyanobacteria) 
.CYP26 (Zebrafish, P450RAi) 

-CYP90A (Arabidopsis, CPD) 
-CYP90B (Arabidopsis, DWF4^ 
- CYPeS (Tomato, Dwarf) 
-CYP88 (Maize, D3) 
-CYP3A3X (Human, P450HLP) 
-CYP71A1 (Avocado) — | 
.CYP83 (Arabidopsis) 
-CYP75A1 (Petunia) 
-CYP76A1 (Eggptant) 
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Increased seed production due to DWF4 overexpression 
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1 ATGTGGGTATTATATTGTTGGGTTCGGTTTGAGCTACAATATAAATTTCGTGTTTCTGGT 60 
61 TATTCTGTTCACATGATTTGAGTTTGGTTCTCAATTTGGATTCCAAGATAATTAAATA^ 120 
121 AAAATTCATTTAAAATATTTACAAGTAATTAATTATCTTTACATTGTATTGTT 180 
181 AATATCTATCTITGGTATATGAGAAAATATGGAGTTTGGAATTTATAATAATAAAGGi^ 240 
241 TAATCGATTCCATTTGGTTGGATTACACAGTTAAGTTTTTGTGTTTCTTTTG 300 
301 ATATGAGTAAATCAAAAAGAGTATTGATTGAAGTGTAAACATATTTCGTTATGACCCCCA 360 
361 AAAAAAAAAAAAAAACAAACAAACAAACCCCCCCCCCGATATAGTTTTTGGTTCTGGATT 420 
421 AGGTTTATTTGATCATAATTACATGCATCATTTCTTTGATTACTATGAAGATTTTCT^^ 480 
481 CAATTAAAATTTCGAATTCATATCTCITGATTATTAAATTAAATACGAGTGTG^ 540 
541 GTTTATCGATCACTCCAATCATGATTATGATTCTTGTGCTAATCCAGCAAATTATTAACA 600 
601 AGAGTATTGAGAAAAAACCGAAAATAAGAAAAGGGAAAGAGTAGTGACCCATGGAGTATG 660 
661 TGAATAATTATCAAAGAGAATAAGAGATGACAACCAAAAGGTTGTGGAATAATGGTCCCT 720 
721 GCCAGCTTTCTCTCACAATCAATATCGACCCTATTTGGATTTTCTGGATATTCGTTAAAA 780 
781 TTTGCGATAACGATTGTGAAAAATATTTTATTTGTTAGCTGATCTCAATATTATGTTCCA 840 
841 GGTATTTGCATAATCTTCTGTTTAAAGCATATTTTGTCTTTCTTTTTGTl^ 900 
901 TAACTATATATTATCGCGGATATATGATAACAATGATATATCACAAAACAATTGTCTGGG 960 
961 ACCATTTTGAATAAACITITrCTCAAACATTACGGGACACTGGACrCGACCC^ 1020 

FIG lOA 



1021 CGATTTTACAGCGTCACTAGTTGAGATTACTAGCATAAAGCATAAAGGACCCGTTC 1080 
1081 TATTTATACAAAGTTACAAACTGAATATAGCTTGAAATCCTTTAGAAAAT^^ 1140 
1141 CCGGTTGTTATGTAAATATAGATTTAGTGGTAAACAAATATGTTAATCAATTAGTG 1200 
1201 ACATATACATAATTCCTTACAGAAAAAACAAACTTAAGAGAAGTTAACATATCCATA 1260 
1261 GGGTATGCTATACCTTTCACGTATGCTATACTAGAGACTAAAGAATAGTTATGTGATGTC 1320 
1321 GATAAATGAAATTCACACGCGTGGTAATAATTATGGGACCGTATGTTACGATCACTGCAA 1380 
1381 ATATCATTCTTGGTTGGTCAACAATAAAAACAAAAACAAGAAAAAAA 1440 
1441 CTTGGATTCCATTCAATGATCTAAAATGCATAGATCTTTTGGGT^^ 1500 
1501 TCTACAAGCGTGTAACCATCTGCAACTATTAAATTGCTTTCTTTAATGCATC^ 1560 
1561 ATTTATTGTTAGTTGGAATTTAATAAGAGCGAACTTGTAACATTACAAT^ 1620 
1621 ATACTAGTATGTGATTATTCCAAATACATACTTTGGATGTTTAAAC^ 1680 
1681 TCCTACGGTATAAATATTAATCATCGAGGTAAAAAAAGTTTTGTCT^ 1740 
1741 ATGAAGGATAAACCTAATGACirrAATTI^ 1800 
1801 AATTACCGTATGTTTTTGTTGCCATAATGACAGCCTCTACAACTGTG^^ 1860 
1861 TCTGCAAATATTAAATTAGGAATTCAATGCTACTATCAATAGAAGAAACAGCTGAGTAT^ 1920 
1921 ACATTTTAATTTAAAGACAAAATTTTTGAAA^ 1980 
1981 AATATGATGCCTATAATGTATTTCCTATGTTCTTAAAATATT^^ 2040 
2041 TAAATACATTATGAACCAATAATAGTTGGTGAATTCAAATATCTCCATTAATAT^^ 2100 

FIG lOB 



2101 AAATCTACAAATTATTAATATTTAGTCAATAACAATGCATAGAAAGTTCCAAAAAAA^ 2160 
2161 TTGTTAACAGAAACTTCCAAATTTTTTT^^ 

2221 CTATTTTGTTGTGGAATGGAAGTAGTAATATACATTAAGCAAATT^ 2280 
2281 AGCCTATACGCGCTCAAAGTATGTTATCTAGTAGGTGTAATTAATAATGCATGGTGCGAT 2340 
2341 TCAGAATTGGGACAACAATGAAAACGGAATTAAAATATTAACTTTAAAATA^ 2400 
2401 TTGAGTAAATGTGTTTTCTGACTATTGAGGGGCAAAAAAAAGACAATGCCAA^ 2460 
2461 GGGTTTGACTGTCCAGTTCGGTAATAATCTAATAACTCTGTCTTTGACCGCACGCT^ 2520 
2521 TAGGGGTCCTTCTGACATTTTCACTGTTCTACCCCTACTCGTGAGCCCACCCTTI^ 2580 
2581 TATCCTAAGGGTAATTTTGGAAATCCCAATTTAAACCGAI^^ 2640 
2641 TGGGATTCTGCTGGAGCATTTATCAAAAATTATTAGCACGAATGGGTTTATTAAT^ 2700 
2701 AACTCACAACTTGATCAGATAAAATTTCATAAACACTTTTACG 2760 
2761 ATCTAATGACTTTTTTTTTTCTACCACGGTGGATGA^ 2820 
2821 ACAATTGATTATAGATATATCCATTAATCCATGATATTTATGATATAAATAGCTGTT 2880 
2881 CTATTTCAGCATCGCAGCTTTCTGCAACTTTTGTTT^ 2940 
2941 AAGTATTAAAAGGAGCATAACGAGGCAACAAAAGTAATGAACACGGAGAAACAAAAGCCA 3000 
3001 TGAAGCTCATTGGTTAGTTTAAGCITAATAAGAAGATT^ 
3061 ATAACAATTATATTTTCTGACTTCTTTAAAACCCCCTC^^ 

3121 TCAGTAGAAGTCCGATTCCCAATCTTAAAGACAAAGCCATTAGAAAGAGAAAGTGAGTGA 3 1 80 

FIG IOC 



3181 GAGAGAGAGAGAAACTAGCTCCATGTT CGAAACAGACKDATCATACTCTCTTACCTCTTCT 3240 

^ _____ 

3241 TClTCTCCCA TCGCTTTTGTCTCTTCTTCTCITCrrTGATTCrC ^ 3300 
' T 

3301 AAAAACCAGATTCAATCTACCTCCGGGTAAATCCGGTIGGCCATTTCTrGGTGAA^ 3360 



3361 CGGTTATCTTAAACCGTACACCGCCACAACACTCGGTGACTTCATGCAACAACATGTCTC 3420 
i - i 

3421 CAAGTAAACAACAACATCnrCCAAAAACTCAAAAAAATAAATCCTCTOTTTTTGAA^ 3480 

cm 

3481 GACTAATGTTGTTTATTTTACAGGTATGGTAAGATATATAGATCGAACTTGTTTGGAGAA 3540 

t — 

3541 CCAACGATCG TATCAGCTGATGCTGGACTTAATAGATTCATATTACAAAACGAAGGAAGG 3600 

' .... 1 

3601 CTCT TTGAATGTAGTTATC CTAGAAGTATAGGTGGGATTCTTGGGAAATGGTCGATGCTT 3660 

'7 

3661 G TTCTTGTTGGTGACATGCATAGAGATATGAGAAGTATCTCGCTTAACTTCTTAAGTCAC 3720 

■ ' — I 

3721 (X:ACGTCTTAGAACrATTCTACTTAAAGATGTTGAGAGACATACr iTGTT^ 

3781 TCITGGCAAC AAAACTCTATTTTCTCTGCTCAAGAC GAGGCCAAAAAG GT^^ 
I i 

3841 ATCTTTTATTTTGCTAAATTTTTTTGTTTATGAATC^ 3900 



3901 TTTAATTGAACAG TTTACGTTTAATCTAATGGCGAAGCATATAATGAGTATGGATCCTGG 3960 

r — ~ 

3961 AGAAGAAGAAACAGAGCAATTAAAGAAAGAGTATGTAACITrCATGAAAGGAGTTGTCTC 4020 

C . ^ . — — 

4021 TGCTCCrCTAAATCTACCAGGAACTGCTTATCATAAAGCT(nTCAGG TACATTTAT^^ 

< ' ' ' ' 

4081 TTTTGCTGTAAAGTCACAAACrCTCATTATAGGTTTTTAATTTTAT^ 4140 



4141 AAAATATCTAAAATGGTTGTGTAGTCACGAGCAACGATATT GAAGTTCATTGAG AGGAAA 4200 

I 

4201 ATGGAAGAGAGAAAATTGGATATCAAGGAAGAAG ATCAAGAAGAAGAAGAAGTGAAAACA 4260 



FTGIOD 



426 1 GAGGATGAAGCAGAGATGAGTAAGAGTGATCATGTTAG GAAACAAAGAACAGACGATGAT 4320 

f , , , , in 

4321 CTTTTG GGATGGGTTTTGAAACAT TCGAATTrATCGACGGAGCAAATTCTCGATCTCATT 4380 
4381 CTTAGTTTGTTATTTGCCGGACATGAGAcTTCITCTGTAGCCATTGCTCTCGCTATCT^ 4440 
4441 TTCTTGCAAG CTTGCCCTAAAGCCGTTGAAGAGCTTAGG GTAAGATAATTATAACAGCAC 4500 
4501 AAGTTAATTACrACCAAATTGTTACGTATTATATAAGTTATTATAGAATTATTCTATTAG 4560 
4561 AATATACGATGAAAAAAGTATGTATATTTAATTGTCACTAATTITATGTTTATTGATTTA 4620 



4621 TAcmTTGAAG GAAGAGCATCTTGAGATCgCGAGGGCCA^ 4680 
4681 AATTAAA TTGGGAT GATTACAAGAA AATGGACTTTACTCAATGTG TATGTTACTATCATT 4740 

r- t- 

4741 CTCATTATTTATTCTATGTTCATATGATTTATGATGAAACCAAAATTATTGATTTT^^ 4800 



4801 TTGGTGTGTGTGAAGGTTATAAATGAAACTCTTCGATTGGGAAATGTAGTTAGGTTTTTG 4860 

r _ .n 

4861 CATCGCAAAGCACTCAAAGATGTTCGGTACAAAGG TAAAACTTTACGTACAAAATTT^ 
~ ! 

4921 AATAATGAAATCCGGAATATTGAAATCrrATTGGATGAAAAATATTAAAATAATTrACAT 4980 
4981 TTCTTAATGTTGGAAAAAAGGATACGATATCCXn'AGTGGGTGGAAAGTGTTACCGGTGAT 5040 



5041 CTCAGCCGTACATTTGGATAATTCTCGTTATGACCAAC CTAATCTCTTTAATCC TTGGAG 5100 

5101 ATGGCAACAG GTAAATAAAAAGTTTCTCTCGTTAACTATCGAAAATTAGTGTATAGTTTT 5160 
— I 

5161 TTCATCTATTGCATGAATAGATACGTCCTACGTGATTTACCTATCTATAGATACTATACG 5220 
5221 AGAACTATTAATCTGGCAAAAACTTITrATTATTATrATCm 5280 
5281 CGTCATGGATCATTGATCACATGAAAGCATATAAATTAAAAATAAGAGAGAGAAAQAGAC 5340 

FIG lOE 



5341 GTGTTGGTGTAAGTGTACGTGAAGACAATTAATTAGTAGGATGGTATGTCTTTAATGACG 5400 
5401 TAGGAGCTGCCTAAATATTCTTATAATCGTGACCGTTGATTTATTATTAGTCAC^ 5460 
5461 GATACAATTTAAGATTTGACGGACGATGGTACCACGGCTTTGACGGATC^^ 5520 
5521 ATGACTTGTACGTGCGTTAGATTCTGCCACGTTGACTGGTTTTAATAOT 5580 
5581 CTCTATTAATTATAACAACTATCAAATCGGCGAATTAGAGAAATATACTATA^^^ 5640 
5641 TTATGATTATTATGAGATAATACTTTATGAAATAAGATAATAATGGTAGTCATGATGTTA 5700 
5701 TAGTGAGTGGGGAAGGTAAGAGGTGGTGAGAGATGATTAATGACCCCACGTGGTGTGGTG 5760 
5761 CCAACAAGCACGTGTTCTTCTTCCTTTTTTCTTCCCAAC^ 5820 
5821 GTGATTTATAAAATCGGTTTGTCGTTTTTT^ 



5881 CATCGTCAGGAAGTGGTAGTTTTTCGACGTGGGGAAACAACTACATGCCGT^ 5940 
^ . 

5941 GGCCAAGGCTATGTGCTGGTTCAGAGCTAGCCAAGTTAGAAATGGCAGTGTTT^^ 6000 



6001 ATCTAGTTCITAAATTCAATTGGGAATTAGCAGAAGATGATCAACCAT^ 6 060 
( ^ ^ ' 

6061 TTGTTGATTTTCCTAACGGTTTG CCTATTAGGGTTTCTCGTA^ 6120 

r 

6121 AGATGAAAGTATITITATTCTCTTCTTTTI^^ 

6181 CCAATGATATATAAAAATTTGGATAAATAATATTATTGGATATTCGTT^^ 6240 

6241 TTTGAGAAAAGGGTTTCGACTTTCGAAAGTGGACGATGTATATAGATTGGGAG^ 6300 

6301 GAGTCTTTGGACATTTGTATTGGATGTTGTTGATTATTAGTGTCGACACT 6360 

6361 AAATGGGCTTTCTATAAGGCCCAATTATATTACGATTATAACAAAGTGACAACT^ 6420 

FIG lOF 



6421 TCGTTTTTGATCCGAAGCAATAACAAATTGTCAAATACCAAACACAAGAAT^^ 6480 
6481 ACTCGTGTGTGTCTAGTGGGAAATCATTGGGCTGGAGACTGAACATCAGAACACAAGAAA 6540 
6541 CCTGTCAATTATGGATACACCTCCTATGACGGTTTCCAAACTTTATCTTGATTC^ 6600 
6601 TGTTACATTGACACAAAGAGTTAGGTGTCAAAAGGACTAAATGAATAACAATAGCTCTC^ 6660 
6661 GGATAAGAAGGTTCATAAAATGGTTTCTTTATTTTGAGAAGA^ 6720 
6721 CTGTTTCTTGGGTCCTATTCCTTTAAATGAGAGGGTTTCGTTT^ 6780 
6781 TCATCTTTAGGATCCTCTTCTAGACGAGTAAAGTAATCCTCGTTACCAAGCAATGGTCTC 6840 
6841 ATCTTTTGAAGACAGGTCTTTTCCAAGTCCTAGTTCAGGCCAAAGCTT 6888 
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1 MFETEHHTLL PLLLLPSLLS LLLFLILLKR 
61 ATTLGDFMQQ HVSKYGKIYR SNLFGEPTIV 
121 GKWSMLVLVG DMHRDMRSIS LNFLSHARLR 
181 KKFTFNLMAK HIMSMDPGEE ETEQLKKEYV 
241 FIERKMEERK LDIKEEDQEE EEVKTEDEAE 
301 ILDLILSLLF AGHETSSVAI ALAIFFLQAC 
3 61 YKKMDFTQCV INETLRLGNV VRFLHRKALK 
421 QPNLFNPWRW QQQNNGASSS GSGSFSTWGN 
481 LKFNWELAED DQPFAFPFVD FPNGLPIRVS 



RNRKTRFNLP PGKSGWPFLG ETIGYLKPYT 
SADAGLNRFI LQNEGRLFEC SYPRSIGGIL 
TILLiCDVERH TLFVLDSWQQ NSIFSAQDEA 
TFMKGWSAP LNLPGTAYHK ALQSRATILK 
MSKSDHVRKQ RTDDDLLGWV LKHSNLSTEQ 
PKAVEELREE HLEIARAKKE LGESELNWDD 
DVRYKGYDIP SGWKVLPVIS AVHLDNSRYD 
NYMPFGGGPR LCAGSELAKL EMAVFIHHLV 
RIL 
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Atty Dkt No. 2225-0001 
Client No. 91004.003 

COMBINED DECLARATION AND POWER OF ATTORNEY 
FOR UTILITY PATENT APPLICATION 

AS A BELOW-NAMED INVENTOR, I HEREBY DECLARE THAT: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if more than one name is Ksted below) of the subject matter which is 
claimed and for which a patent is sought on the invention entitled: DWF4 POLYNUCLEOTIDES, 
POLYPEPTIDES AND USES THEREOF the specification of which 

X is attached hereto 
was filed on 

and assigned Serial No. and was amended on . 

I HAVE REVIEWED AND UNDERSTAND THE CONTENTS OF THE ABOVE-IDENTIFIED 
SPECIFICATION, INCLUDING THE CLAIMS, AS AMENDED BY ANY AMENDMENT 
REFERRED TO ABOVE. 

I acknowledge and understand that I am an individual who has a duty to disclose information which 
is material to the patentability of the claims of this application in accordance with Title 37, Code of 
Federal Regulations, §§ 1.56(a) and (b) which state: 

(a) A patent by its very nature is affected with a public interest. The public interest is 
best served, and the most effective patent examination occurs when, at the time an 
application is being examined, the Office is aware of and evaluates the teachings of 
all information material to patentability. Each individual associated with the filing 
and prosecution of a patent application has a duty of candor and good faith in dealing 
with the Office, which includes a duty to disclose to the Office all information 
known to that individual to be material to patentability as defined in this section. 
The duty to disclose information exists with respect to each pending claim until the 
claim is canceled or withdrawn from consideration, or the application becomes 
abandoned. Information material to the patentability of a claim that is canceled or 
withdrawn fi-om consideration need not be submitted if the information is not 
material to the patentability of any claim remaining under consideration in the 
application. There is no duty to submit information which is not material to the 
patentability of any existing claim. The duty to disclose all information known to be 
material to patentability is deemed to be satisfied if all information known to be 
material to patentability of any claim issued in a patent was cited by the Office or 
submitted to the Office in the maimer prescribed by §§ L97(b)-(d) and 1.98. 
However, no patent will be granted on an application in connection with which fi*aud 
on the Office was practiced or attempted or the duty of disclosure was violated 
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through bad faith or intentional misconduct. The Office encourages apphcants to 
carefully examine: 

(1) prior art cited in search reports of a foreign patent office in a counterpart 
application, and 

(2) the closest information over which individuals associated with the filing 
or prosecution of a patent application believe any pending claim patentably defines, 
to make sure that any material information contained therein is disclosed to the 
Office. 



(b) Under this section, information is material to patentabiUty when it is not 
cumulative to information already of record or being made of record in the 
application, and 

(1) It establishes, by itself or in combination with other information, a prima 
facie case of unpatentability of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i) Opposing an argument of unpatentability rehed on by the Office, 

or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a 
conclusion that a claim is unpatentable under the preponderance of evidence, burden- 
of-proof standard, giving each term in the claim its broadest reasonable construction 
consistent with the specification, and before any consideration is given to evidence 
which may be submitted in an attempt to estabhsh a contrary conclusion of 
patentability. 

I do not know and do not believe this invention was ever known or used in the United States of 
America before my or our invention thereof, or patented or described in any printed publication in 
any country before my or our invention thereof or more than one year prior to said application. This 
invention was not in public use or on sale in the United States of America more than one year prior 
to this apphcation. This invention has not been patented or made the subject of an inventor's 
certificate issued before the date of this application in any country foreign to the United States of 
America on any application filed by me or my legal representatives or assigns more than six months 
prior to this application. 

I hereby claim priority benefits under Title 35, United States Code § 1 19(e)(1) of any United States 
provisional application(s) for patent as indicated below and have also identified below any 
application for patent on this invention having a filing date before that of the application for patent 
on which priority is claimed: 

Date of Filing Priority 
Application No. (dav/month/vear) Claimed 

60/1 19,657 1 1 February 1999 Yes ^ No _ 

60/1 19,658 1 1 February 1999 Yes X No _ 
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I hereby appoint the following attorneys and agents to prosecute that application and to transact all 
business in the Patent and Trademark Office connected therewith and to file, to prosecute and to 
transact all business in connection with all patent applications directed to the invention: 



Roberta L. Robins, Reg. No. 33,208 
Dahna S. Pasternak, Reg. No. 41,41 1 
Vandana Date, Reg. No. 38,675 
Gary R. Fabian, Ph.D., Reg, No. 33,875 

Address all correspondence to: Dahna S. Pasternak at 

ROBINS & ASSOCIATES 
90 Middlefield Road, Suite 200 
Menlo Park, CA 94025. 

Address all telephone calls to: Dahna S. Pasternak at (650) 325-7812. 

This appointment, including the right to delegate this appointment, shall also apply to the same 
extent to any proceedings established by the Patent Cooperation Treaty. 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are beUeved to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under § 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 



Signature: Date 

Full Name of Inventor: Ricardo AZPIROZ 
Citizenship: Mexico 
Residence: Dallas, TX 

Post Office Address: 6634 Winding Rose Trail, Dallas, TX 75252 

Signature: Date 

Full Name of Inventor: Sunghwa CHOE 
Citizenship; Korea 
Residence: Tucson, AZ 

Post Office Address: 5855 North Court, No. 4211, Tucson, AZ 85750 

Signature: Date 

Full Name of Inventor: Kenneth A. FELDMANN 

Citizenship: US 

Residence: Newbury Park, CA 

Post Office Address: 3806 Calle Mazatlan, Newbury Park, CA 91320-1927 
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SEQUENCE LISTING 

<110> AZPIR02, Ricardo 
CHOE, Sunghwa 
FELDMAJSnsr, Kenneth 



<120> DWF4 POLYNUCLEOTIDES, POLYPEPTIDES AND USES THEREOF 



<130> 2225-0001 



<140> 
<141> 



<150> 60/119,657 
<151> 1999-02-11 



<150> 60/119,658 
<151> 1999-02-11 



<160> 18 



<170> Patentin Ver. 2.0 



<210> 1 

<211> 6888 

<212> DNA 

<213> Arabidopsis sp. 



<400> 1 

atgtgggtat tatattgttg ggttcggttt gagctacaat ataaatttcg tgtttctggt 60 
tattctgttc acatgatttg agtttggttc tcaatttgga ttccaagata attaaatatt 120 
aaaattcatt taaaatattt acaagtaatt aattatcttt acattgtatt gttataacaa 180 
aatatctatc tttggtatat gagaaaatat ggagtttgga atttataata ataaaggaaa 240 
taatcgattc catttggttg gattacacag ttaagttttt gtgtttcttt tgttatatgt 3 00 
atatgagtaa atcaaaaaga gtattgattg aagtgtaaac atatttcgtt atgaccccca 360 
aaaaaaaaaa aaaaacaaac aaacaaaccc cccccccgat atagtttttg gttctggatt 420 
aggtttattt gatcataatt acatgcatca tttctttgat tactatgaag attttcttac 480 
caattaaaat ttcgaattca tatctcttga ttattaaatt aaatacgagt gtgaatatcc 540 
gtttatcgat cactccaatc atgattatga ttcttgtgct aatccagcaa attattaaca 600 
agagtattga gaaaaaaccg aaaataagaa aagggaaaga gtagtgaccc atggagtatg 660 
tgaataatta tcaaagagaa taagagatga caaccaaaag gttgtggaat aatggtccct 720 
gccagctttc tctcacaatc aatatcgacc ctatttggat tttctggata ttcgttaaaa 780 
tttgcgataa cgattgtgaa aaatatttta tttgttagct gatctcaata ttatgttcca 840 
ggtatttgca taatcttctg tttaaagcat attttgtctt tctttttgtt tcgtttctct 900 
taactatata ttatcgcgga tatatgataa caatgatata tcacaaaaca attgtctggg 960 
accattttga ataaactttt tctcaaacat tacgggacac tggactcgac ccttaaaata 1020 
cgattttaca gcgtcactag ttgagattac tagcataaag cataaaggac ccgttcaagc 108 0 
tatttataca aagttacaaa ctgaatatag cttgaaatcc tttagaaaat tttggaatta 1140 
ccggttgtta tgtaaatata gatttagtgg taaacaaata tgttaatcaa ttagtggtca 12 00 
acatatacat aattccttac agaaaaaaca aacttaagag aagttaacat atccatatat 1260 
gggtatgcta tacctttcac gtatgctata ctagagacta aagaatagtt atgtgatgtc 132 0 
gataaatgaa attcacacgc gtggtaataa ttatgggacc gtatgttacg atcactgcaa 13 80 
atatcattct tggttggtca acaataaaaa caaaaacaag aaaaaaagaa aacgattttt 1440 
cttggattcc attcaatgat ctaaaatgca tagatctttt gggttacagt ttcgaagtcc 1500 
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tctacaagcg tgtaaccatc tgcaactatt aaattgcttt ctttaatgca tctttaacat 1560 
atttattgtt agttggaatt taataagagc gaacttgtaa cattacaata tttatattag 162 0 
atactagtat gtgattattc caaatacata ctttggatgt ttaaacttaa tcttgtttct 1680 
tcctacggta taaatattaa tcatcgaggt aaaaaaagtt ttgtcttatt ttcgcgatgc 1740 
atgaaggata aacctaatga ctttaatttt ttgaaaatgt aaccctttta ctcatagatt 180 0 
aattaccgta tgtttttgtt gccataatga cagcctctac aactgtgata gtcaattttt 1860 
tctgcaaata ttaaattagg aattcaatgc tactatcaat agaagaaaca gctgagtatt 192 0 
acattttaat ttaaagacaa aatttttgaa aaatgttata atttctaaca atattattaa 1980 
aatatgatgc ctataatgta tttcctatgt tcttaaaata ttttttttta tatttagtta 2040 
taaatacatt atgaaccaat aatagttggt gaattcaaat atctccatta atattttttg 2100 
aaatctacaa attattaata tttagtcaat aacaatgcat agaaagttcc aaaaaaaatt 216 0 
ttgttaacag aaacttccaa attttttttt tttatggaac aagaaataac agatagaaaa 222 0 
ctattttgtt gtggaatgga agtagtaata tacattaagc aaattttaaa aaattatata 22 8 0 
agcctatacg cgctcaaagt atgttatcta gtaggtgtaa ttaataatgc atggtgcgat 23 4 0 
tcagaattgg gacaacaatg aaaacggaat taaaatatta actttaaaat aaataaaaat 24 0 0 
ttgagtaaat gtgttttctg actattgagg ggcaaaaaaa agacaatgcc aaaagtctac 2460 
gggtttgact gtccagttcg gtaataatct aataactctg tctttgaccg cacgctcgtg 252 0 
taggggtcct tctgacattt tcactgttct acccctactc gtgagcccac ccttttccca 2580 
tatcctaagg gtaattttgg aaatcccaat ttaaaccgat tgagaccgta ccggacttcc 2 64 0 
tgggattctg ctggagcatt tatcaaaaat tattagcacg aatgggttta ttaatttaaa 2 70 0 
aactcacaac ttgatcagat aaaatttcat aaacactttt acgatggatt cgtacgatct 2760 
atctaatgac tttttttttt ctaccacggt ggatgaaagt tatagtacta ttagccagag 2 82 0 
acaattgatt atagatatat ccattaatcc atgatattta tgatataaat agctgttaaa 2 880 
ctatttcagc atcgcagctt tctgcaactt ttgtttttaa tttaagagtt taataaataa 2940 
aagtattaaa aggagcataa cgaggcaaca aaagtaatga acacggagaa acaaaagcca 3 0 00 
tgaagctcat tggttagttt aagcttaata agaagatttt attaaatttt aatgacgatg 3 060 
ataacaatta tattttctga cttctttaaa accccctctt acaaacagaa gctccctttt 3120 
tcagtagaag tccgattccc aatcttaaag acaaagccat tagaaagaga aagtgagtga 318 0 
gagagagaga gaaactagct ccatgttcga aacagagcat catactctct tacctcttct 3240 
tcttctccca tcgcttttgt ctcttcttct cttcttgatt ctcttgaaga gaagaaatag 3300 
aaaaaccaga ttcaatctac ctccgggtaa atccggttgg ccatttcttg gtgaaaccat 3360 
cggttatctt aaaccgtaca ccgccacaac actcggtgac ttcatgcaac aacatgtctc 3420 
caagtaaaca acaacatctt ccaaaaactc aaaaaaataa atcctctgtt tttgaaattt 3480 
gactaatgtt gtttatttta caggtatggt aagatatata gatcgaactt gtttggagaa 3540 
ccaacgatcg tatcagctga tgctggactt aatagattca tattacaaaa cgaaggaagg 3600 
ctctttgaat gtagttatcc tagaagtata ggtgggattc ttgggaaatg gtcgatgctt 3660 
gttcttgttg gtgacatgca tagagatatg agaagtatct cgcttaactt cttaagtcac 3720 
gcacgtctta gaactattct acttaaagat gttgagagac atactttgtt tgttcttgat 3780 
tcttggcaac aaaactctat tttctctgct caagacgagg ccaaaaaggt ttttattttt 3840 
atcttttatt ttgctaaatt tttttgttta tgaatcttta gagtttctaa cttttttttt 3900 
tttaattgaa cagtttacgt ttaatctaat ggcgaagcat ataatgagta tggatcctgg 3 960 
agaagaagaa acagagcaat taaagaaaga gtatgtaact ttcatgaaag gagttgtctc 4 02 0 
tgctcctcta aatctaccag gaactgctta tcataaagct cttcaggtac atttattttt 4080 
ttttgctgta aagtcacaaa ctctcattat aggtttttaa ttttatttta tgtgttaaat 4140 
aaaatatcta aaatggttgt gtagtcacga gcaacgatat tgaagttcat tgagaggaaa 42 0 0 
atggaagaga gaaaattgga tatcaaggaa gaagatcaag aagaagaaga agtgaaaaca 42 60 
gaggatgaag cagagatgag taagagtgat catgttagga aacaaagaac agacgatgat 432 0 
cttttgggat gggttttgaa acattcgaat ttatcgacgg agcaaattct cgatctcatt 43 80 
cttagtttgt tatttgccgg acatgagact tcttctgtag ccattgctct cgctatcttc 4440 
ttcttgcaag cttgccctaa agccgttgaa gagcttaggg taagataatt ataacagcac 45 0 0 
aagttaatta ctaccaaatt gttacgtatt atataagtta ttatagaatt attctattag 4560 
aatatacgat gaaaaaagta tgtatattta attgtcacta attttatgtt tattgattta 4620 
tacttttgaa ggaagagcat cttgagatcg cgagggccaa gaaggaacta ggagagtcag 4680 
aattaaattg ggatgattac aagaaaatgg actttactca atgtgtatgt tactatcatt 474 0 
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ctcattattt attctatgtt catatgattt atgatgaaac caaaattatt gatttttttt 4800 
ttggtgtgtg tgaaggttat aaatgaaact cttcgattgg gaaatgtagt taggtttttg 4860 
catcgcaaag cactcaaaga tgttcggtac aaaggtaaaa ctttacgtac aaaattttta 492 0 
aataatgaaa tccggaatat tgaaatctta ttggatgaaa aatattaaaa taatttacat 4980 
ttcttaatgt tggaaaaaag gatacgatat ccctagtggg tggaaagtgt taccggtgat 5040 
ctcagccgta catttggata attctcgtta tgaccaacct aatctcttta atccttggag 5100 
atggcaacag gtaaataaaa agtttctctc gttaactatc gaaaattagt gtatagtttt 5160 
ttcatctatt gcatgaatag atacgtccta cgtgatttac ctatctatag atactatacg 5220 
agaactatta atctggcaaa aactttttat tattattatc tttcaagtta gatcttaaca 5280 
cgtcatggat cattgatcac atgaaagcat ataaattaaa aataagagag agaaagagac 534 0 
gtgttggtgt aagtgtacgt gaagacaatt aattagtagg atggtatgtc tttaatgacg 54 0 0 
taggagctgc ctaaatattc ttataatcgt gaccgttgat ttattattag tcacggcttt 5460 
gatacaattt aagatttgac ggacgatggt accacggctt tgacggatct cacacgcccg 552 0 
atgacttgta cgtgcgttag attctgccac gttgactggt tttaatactt agatttataa 5580 
ctctattaat tataacaact atcaaatcgg cgaattagag aaatatacta tatagtatta 5640 
ttatgattat tatgagataa tactttatga aataagataa taatggtagt catgatgtta 5700 
tagtgagtgg ggaaggtaag aggtggtgag agatgattaa tgaccccacg tggtgtggtg 5760 
ccaacaagca cgtgttcttc ttcctttttt cttcccaact tctttttttg ggggtttatt 5820 
gtgatttata aaatcggttt gtcgtttttt tttgtgacga gcagcaaaac aacggagcgt 5880 
catcgtcagg aagtggtagt ttttcgacgt ggggaaacaa ctacatgccg tttggaggag 5 94 0 
ggccaaggct atgtgctggt tcagagctag ccaagttaga aatggcagtg tttattcatc 6000 
atctagttct taaattcaat tgggaattag cagaagatga tcaaccattt gcttttcctt 6060 
ttgttgattt tcctaacggt ttgcctatta gggtttctcg tattctgtaa aaaaaaaaaa 612 0 
agatgaaagt atttttattc tcttcttttt tttttgataa ttttaaatca ttttttttgc 6180 
ccaatgatat ataaaaattt ggataaataa tattattgga tattcgtttt ttagttcggg 624 0 
tttgagaaaa gggtttcgac tttcgaaagt ggacgatgta tatagattgg gagctaggtt 6300 
gagtctttgg acatttgtat tggatgttgt tgattattag tgtcgacact attaaacctt 63 60 
aaatgggctt tctataaggc ccaattatat tacgattata acaaagtgac aacttttact 642 0 
tcgtttttga tccgaagcaa taacaaattg tcaaatacca aacacaagaa ttatgtaaac 64 8 0 
actcgtgtgt gtctagtggg aaatcattgg gctggagact gaacatcaga acacaagaaa 6540 
cctgtcaatt atggatacac ctcctatgac ggtttccaaa ctttatcttg attcttatcg 6600 
tgttacattg acacaaagag ttaggtgtca aaaggactaa atgaataaca atagctctca 6660 
ggataagaag gttcataaaa tggtttcttt attttgagaa gaaagagaga ggagctttta 672 0 
ctgtttcttg ggtcctattc ctttaaatga gagggtttcg tttttacttc ttctatctca 6780 
tcatctttag gatcctcttc tagacgagta aagtaatcct cgttaccaag caatggtctc 6840 
atcttttgaa gacaggtctt ttccaagtcc tagttcaggc caaagctt 6888 

<210> 2 
<211> 513 
<212> PRT 

<213> Arabidopsis sp. 
<400> 2 

Met Phe Glu Thr Glu His His Thr Leu Leu Pro Leu Leu Leu Leu Pro 
15 10 15 

Ser Leu Leu Ser Leu Leu Leu Phe Leu lie Leu Leu Lys Arg Arg Asn 
20 25 30 

Arg Lys Thr Arg Phe Asn Leu Pro Pro Gly Lys Ser Gly Trp Pro Phe 
35 40 45 

Leu Gly Glu Thr lie Gly Tyr Leu Lys Pro Tyr Thr Ala Thr Thr Leu 
50 55 60 
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Gly Asp Phe Met Gin Gin His Val Ser Lys Tyr Gly Lys lie Tyr Arg 
65 70 75 80 

Ser Asn Leu Phe Gly Glu Pro Thr He Val Ser Ala Asp Ala Gly Leu 
85 90 95 

Asn Arg Phe He Leu Gin Asn Glu Gly Arg Leu Phe Glu Cys Ser Tyr 
100 105 110 

Pro Arg Ser He Gly Gly He Leu Gly Lys Trp Ser Met Leu Val Leu 
115 120 125 

Val Gly Asp Met His Arg Asp Met Arg Ser He Ser Leu Asn Phe Leu 
130 135 140 

Ser His Ala Arg Leu Arg Thr He Leu Leu Lys Asp Val Glu Arg His 
145 150 155 160 

Thr Leu Phe Val Leu Asp Ser Trp Gin Gin Asn Ser He Phe Ser Ala 
165 170 175 

Gin Asp Glu Ala Lys Lys Phe Thr Phe Asn Leu Met Ala Lys His He 
180 185 190 

Met Ser Met Asp Pro Gly Glu Glu Glu Thr Glu Gin Leu Lys Lys Glu 
195 200 205 

Tyr Val Thr Phe Met Lys Gly Val Val Ser Ala Pro Leu Asn Leu Pro 
210 215 220 

Gly Thr Ala Tyr His Lys Ala Leu Gin Ser Arg Ala Thr He Leu Lys 
225 230 235 240 

Phe He Glu Arg Lys Met Glu Glu Arg Lys Leu Asp He Lys Glu Glu 
245 250 255 

Asp Gin Glu Glu Glu Glu Val Lys Thr Glu Asp Glu Ala Glu Met Ser 
260 265 270 

Lys Ser Asp His Val Arg Lys Gin Arg Thr Asp Asp Asp Leu Leu Gly 
275 280 285 

Trp Val Leu Lys His Ser Asn Leu Ser Thr Glu Gin He Leu Asp Leu 
290 295 300 

He Leu Ser Leu Leu Phe Ala Gly His Glu Thr Ser Ser Val Ala He 
305 310 315 320 

Ala Leu Ala He Phe Phe Leu Gin Ala Cys Pro Lys Ala Val Glu Glu 
325 330 335 

Leu Arg Glu Glu His Leu Glu He Ala Arg Ala Lys Lys Glu Leu Gly 
340 345 350 
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Glu Ser Glu Leu 
355 

Cys Val He Asn 

370 

His Arg Lys Ala 
385 

Ser Gly Trp Lys 



Ser Arg Tyr Asp 
420 

Gin Asn Asn Gly 
435 

Gly Asn Asn Tyr 
450 

Ser Glu Leu Ala 
465 

Leu Lys Phe Asn 



Pro Phe Val Asp 
500 



Asn Trp Asp Asp 
360 

Glu Thr Leu Arg 

375 

Leu Lys Asp Val 
390 

Val Leu Pro Val 
405 

Gin Pro Asn Leu 



Ala Ser Ser Ser 
440 

Met Pro Phe Gly 
455 

Lys Leu Glu Met 
470 

Trp Glu Leu Ala 
485 

Phe Pro Asn Gly 



Tyr Lys Lys Met 



Leu Gly Asn Val 
380 

Arg Tyr Lys Gly 
395 

lie Ser Ala Val 
410 

Phe Asn Pro Trp 
425 

Gly Ser Gly Ser 



Gly Gly Pro Arg 
460 

Ala Val Phe He 
475 

Glu Asp Asp Gin 
490 

Leu Pro He Arg 
505 



Asp Phe Thr Gin 
365 

Val Arg Phe Leu 



Tyr Asp He Pro 
400 

His Leu Asp Asn 
415 

Arg Trp Gin Gin 
430 

Phe Ser Thr Trp 
445 

Leu Cys Ala Gly 



His His Leu Val 
480 

Pro Phe Ala Phe 
495 

Val Ser Arg He 
510 



Leu 



<210> 3 
<211> 24 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D40VERF 
<400> 3 

atgttcgaaa cagagcatca tact 24 

<210> 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4PRM 
<400> 4 
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cctcgatcaa agagagagag a 



<210> 5 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4RTF 
<400> 5 

ttcttggtga aaccatcggt tatcttaaa 

<210> 6 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4RTR 
<400> 6 

tatgataagc agttcctggt agattt 

<210> 7 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4F1 
<400> 7 

cgaggcaaca aaagtaatga a 

<210> 8 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4R1 
<400> 8 

gttagaaact ctaaagattc a 

<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4F2 



<400> 14 

gcgaagcata taatgagtat ggat 

<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4R5 
<400> 15 

gttggtcata acgagaatta tccaaa 

<210> 16 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4XLINIT 
<400> 16 

taggatccag ctagtttctc tctctctct 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer T7 
<400> 17 

taatacgact cactataggg 

<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
D40VERFA 

<400> 18 

gaattctaga atgttcgaaa cagagcatca ta 
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<400> 9 

gattcttggc aacaaaactc tat 



23 



<210> 10 

<211> 20 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4R2 

<400> 10 

ccgaacatct ttgagtgctt 20 

<210> 11 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4F3 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4R3 
<400> 12 

ggtttaatag tgtcgacact aata 24 

<210> 13 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4F4 



<210> 14 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer D4F5 



<400> 11 

gtgtgaaggt tataaatgaa actctt 
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<400> 13 

ccgatgactt gtacgtgcgt ta 
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